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(57) Abstract 



A new receptor family has been identified, of activin-like kinases. Novel proteins have activin/TGF-P-type I receptor func- 
tionality, and have consequential diagnostic/therapeutic utility. They may have a serine/threonine kinase domain, a DFKSRN or 
DLKSKN sequence in subdomain VIB and/or a GTKRYM sequence in subdomain VIII. 
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ACTIVIN RECEPTOR-LIKE KINASES, PROTEINS HAVING 
SERINE THREONINE KINASE DOMAINS AND THEIR USE. 

Field of the Invention 

This invention relates to proteins having 
5 serine/threonine kinase domains, corresponding nucleic acid 
molecules, and their use. 
Background of the Invention 

The transforming growth factor-6 (TGF-6) superfamily 
consists of a fzuaily of structxar ally-related proteins, 

10 including three different mammalian isoforms of TGF-B (TGF- 
61, B2 and B3) , activins, inhibins, miillerian-inhibiting 
substance and bone morphogenic proteins (BMPs) (for reviews 
see Roberts and Sporn, (1990) Peptide Growth Factors and 
Their Receptors, Pt.l, Sporn and Roberts, eds. (Berlin: 

15 Springer - Verlag) pp 419-472; Moses et al (1990) Cell 63, 
245-247) . The proteins of the TGF-6 superfamily have a 
wide variety of biological activities. TGF-6 acts as a 
growth inhibitor for many cell types and appears to play a 
central role in the regulation of embryonic development, 

20 tissue regeneration, immuno-regulation, as well as in 
fibrosis and carcinogenesis (Roberts and Sporn (199) see 
above) . 

Activins and inhibins were originally identified as 
factors which regulate secretion of follicle-stimulating 

25 hormone secretion (Vale et aX (1990) Peptide Growth Factors 
and Their Receptors, Pt.2, Sporn and Roberts, eds. (Berlin: 
Springer-Verlag) pp. 211-248). Activins were also shown to 
induce the differentiation of haematopoietic progenitor 
cells (Murata et ai (1988) Proc. Natl. Acad. Sci. USA 85/ 

30 2434 - 2438; Eto et al (1987) Biochem. Biophys. Res. 
Commun. 142, 1095-1103) and induce mesoderm formation in 
Xenopus embryos (Smith et (1990) Nature 345 . 729-731; 
van den Eijnden-Van Raaij et al (1990) Nature 345, 732- 
734) . 

35 BMPs or osteogenic proteins which induce the formation 

of bone and cartilage when implanted subcutaneous ly (Wozney 
e£ al (1988) Science 242, 1528-1534), facilitate neuronal 
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differentiation (Paralkar et al (1992) J. Cell Biol. 119 . 
1721-1728) and induce monocyte chemotaxis (Cunningh£UQ efc 
(1992) Proc. Natl. Acad. Sci. USA fia, 11740-11744), 
MQllerian-inhibiting substance induces regression of the 
5 MUllerian duct in the male reproductive system (Gate e£ 

(1986) Cell 15/ 685-698), and a glial cell line-derived 
nexirotrophic factor enhances survival of midbrain 
dopaminergic neurons (Lin et (1993) Science 260 . 1130- 
1132) - The action of these growth factors is mediated 

10 through binding to specific cell surface receptors. 

Within this feuaily, TGF-B receptors have been most 
thoroughly characterized. By covalently cross-linking 
radio-labelled TGF-fi to cell surface molecules followed by 
polyacrylamide gel electrophoresis of the affinity-labelled 

15 complexes, three distinct size classes of cell surface 
proteins (in most cases) have been identified, denoted 
receptor type I (53 kd) , type II (75 kd) , type III or 
betaglycan (a 300 kd proteoglycan with a 120 kd core 
protein) (for a review see Massague (1992) Cell .69 1067- 

20 1070) and more recently endoglin (a homodimer of two 95 kd 
subunits) (Cheifetz et al (1992) J. Biol. Chem. 267 19027- 
19030) . Current evidence suggests that type I and type II 
receptors are directly involved in receptor signal 
transduction (Segarini et al (1989) Mol. Endo., 3., 261-272; 

25 Laiho et ai (1991) J. Biol. Chem. 266 . 9100-9112) and may 
form a heteromeric complex; the type II receptor is needed 
for the binding of TGF-B to the type I receptor and the 
type I receptor is needed for the signal transduction 
induced by the type II receptor (Wrana et ai (1992) Cell, 

30 71, 1003-1004) . The type III receptor and endoglin may 
have more indirect roles, possibly by facilitating the 
binding of ligand to type II receptors (Hang et ai (1991) 
Cell, 67 797-805; L6pez-Casillas et al (1993) Cell, 73 
1435-1444) . 

35 Binding analyses with activin A and BMP4 have led to 

the identification of two co-existing cross-linked affinity 
complexes of 50-60 kDa and 70-80 kDa on responsive cells 
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(Hino efe ai (1989) J. Bid. Chem. 264 . 10309 - 10314; 
Mathews and Vale (1991), Cell 68, 775-785; Paralker et al 

(1991) Proc. Natl. Acad. Sci. USA 87, 8913-8917) . By 
analogy with TGF-B receptors they are thought to be 

5 signalling receptors and have been named type I and type II 
receptors . 

Among the type II receptors for the TGF-fl superfamily 
of proteins, the cDNA for the activin type II receptor (Act 
RII) was the first to be cloned (Mathews and Vale (1991) 

10 Cell 6^, 973-982) . The predicted structure of the receptor 
was shown to be a transmembrane protein with an 
intracellular serine/threonine kinase domain. The activin 
receptor is related to the C. eleaans daf -1 gene product, 
but the ligand is currently unknown (Georgi et al (1990) 

15 Cell 61, 635-645) . Thereafter, another form of the activin 
type II receptor (activin type IIB receptor) , of which 
there are different splicing variants (Mathews efc al. 

(1992) , Science 225 . 1702-1705; Attisano et, ai (1992) Cell 
68 . 97-108) , and the TGF-B type II receptor (TBRII) (Lin et 

20 ai (1992) Cell eg., 775-785) were cloned, both of which have 
putative serine/threonine kinase domains. 
Summary of the Invention 

The present invention involves the discovery of 
related novel peptides, including peptides having the 

25 activity of those defined herein as SEQ ID Nos. 2, 4, 8, 
10, 12, 14, 16 and 18. Their discovery is based on the 
realisation that receptor serine/threonine kinases form a 
new receptor fcunily, which may include the type II 
receptors for other proteins in the TGF-B superfamily. To 

30 ascertain whether there were other members of this family 
of receptors, a protocol was designed to clone ActRII/ daf 
I related cDNAs. This approach made use of the polymerase 
chain reaction (PCR) , using degenerate primers based upon 
the amino-acid sequence similarity between kinase domains 

35 of the mouse activin type II receptor and daf -I gene 
products . 
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This s-tra-tegy resulted in "the isolation of a new 
fsunily of receptor kinases called Activin receptor l^ike 
Kinases (ALK's) 1-6. These cDNAs showed an overall 33-39% 
sequence similarity with ActRII and TGF-B type II receptor 
5 and 40-92% secpaence similarity towards each other in the 
kinase domains. 

Soluble receptors according to the invention comprise 
at least predominantly the extracellular domain. These can 
be selected from the information provided herein, prepared 
10 in conventional manner, and used in any manner associated 
with the invention. 

Antibodies to the peptides described herein may be 
raised in conventional manner. By selecting unique 
secpiences of the peptides, antibodies having desired 
15 specificity can be obtained. 

The antibodies may be monoclonal, prepared in ]cnown 
manner. In particular, monoclonal antibodies to the 
extracellular domain are of potential value in therapy. 

Products of the invention are useful in diagnostic 
20 methods, e.g. to determine the presence in a sample for an 
analyte binding therewith, such as in an antagonist assay. 
Conventional techniques, e.g. an enzyme-linked 
immunosorbent assay, may be used. 

Products of the invention having a specific receptor 
25 activity can be used in therapy, e.g. to modulate 
conditions associated with activin or TGF-P activity. Such 
conditions include fibrosis, e.g. liver cirrhosis and 
pulmonary fibrosis, cancer, rheumatoid arthritis and 
glomeronephr itis . 
30 Brief Description of the Drawings 

Figure 1 shows the alignment of the serine/threonine 
(S/T) kinase domains (I-VIII) of related receptors from 
transmembrane proteins , including embodiments of the 
present invention. The nomenclature of the subdomains is 
35 accordingly to Hanks et al (1988) . 

Figures 2 A to 2D shows the sequences and 
characteristics of the respective primers used in the 
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initial PGR reactions. The nucleic acid seq[uences are also 
given as SEQ ID Nos. 19 to 22. 

Figvire 3 is a comparison of the amino-acid sequences 
of human activin type II receptor (Act R-II) , mouse activin 
5 type IIB receptor (Act R-IIB) , human TGF-6 type II receptor 
(TBR-II) , human TGF-6 type I receptor (AIiK-5) , human 
activin receptor type lA ( AI.K-2 ) , and type IB (AIiK-4 ) , ALKs 
1 & 3 and mouse ALK-G. 

Figure 4 shows, schematically, the stiructures for Daf - 
10 X, Act R-II, Act R-IIB, TBR-II, TBR-I/ALK-5, AUC's -1, -2 
(Act RIA) , -3, -4 (Act RIB) & -6. 

Figure 5 shows the sequence alignment of the cysteine- 
rich domains of the AUCs, TBR-II, Act R-II, Act R-IIB and 
daf -1 receptors. 
15 Figture 6 is a comparison of kinase domains of 

serine/ threonine kinases, showing the percentage amino-acid 
identity of the kinase domains. 

Figure 7 shows the pairwise alignment relationship 
between the kinase domains of the receptor serine/ threonine 
20 kinases. The dendrogxam was generated using the Jotun-Hein 
alignment program (Hein (1990) Meth. Enzymol. 183 . 626- 
645) . 

Brief Description of the Secmence Listings 

Sequences 1 and 2 are the nucleotide and deduced 
25 amino-acid sequences of cDNA for hALK-1 (clone HP57) . 

Sequences 3 and 4 are the nucleotide and deduced 
amino-acid sequences of cDNA for hALK-2 (clone HP53) . 

Sequences 5 and 6 are the nucleotide and deduced 
amino-acid sequences of cDNA for hAIiK-3 (clone ONF5) . 
30 Sequences 7 and 8 the nucleotide and deduced amino- 

acid sequences of cDNA for hALK-4 (clone 11H8) , 
complemented with PGR product encoding extracellular 
domain . 

Sequences 9 and 10 are the nucleotide and deduced 
35 amino-acid sequences of cDNA for hAIjK-5 (clone EMBLA) . 

Sequences 11 and 12 are the nucleotide and deduced 
eunino-acid sequences of cDNA for mALK-1 (clone AM6) . 
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Sec[uences 13 and 14 are "the nucleotide and deduced 
amino-acid sequences of cDNA for inALK-3 (clones ME-7 and 
ME-D) . 

Sequences 15 and 16 are -the nucleotide and deduced 
5 amino-acid sequences of cDNA for inALK-4 (clone Sal) . 

Sequences 17 and 18 are the nucleotide and deduced 
amino-acid sequences of cDNA for mALK-e (clone HE-6) . 

Sequence 19 (Bl-S) is a sense primer, extracellular 
domain r cysteine-rich region, BeunHX site at 5' end, 28-mer, 
10 64-fold degeneracy. 

Sequence 20 (B3-S) is a sense primer, kinase domain 
II, BamHI site at 5' end, 25-mer, 162-fold degeneracy. 

Sequence 21 (BT-S) is a sense primer, kinase domain 
VIB, S/T kinase specific residues, BamHI site at 5' end, 
15 24-mer, 288-fold degeneracy. 

Sequence 22 (E8-AS) is an anti-sense primer, kinase 
domain, S/T kinase-specif ic residues EcoRI site at 5' end, 
20-mer, 18-fold degeneracy. 

Sequence 23 is an oligonucleotide probe. 
20 Sequence 24 is a 5' primer. 

Sequence 25 is a 2* primer. 

Sequence 26 is a consensus sequence in Subdomain I. 
Sequences 27 and 28 are novel sequence motifs in 
Subdomain VIB. 

25 Sequence 29 is a novel sequence motif in Subdomain 

VIII. 

Description of the Invention 

As described in more detail below, nucleic acid 
sequences have been isolated, coding for a new sub-family 

30 of serine /threonine receptor kinases. The term nucleic 
acid molecules as used herein refers to any sec[uence which 
codes for the murine, human or meunmalian form, eunino-acid 
sequences of which are presented herein. It is understood 
that the well known phenomenon of codon degeneracy provides 

35 for a great deal of sequence variation and all such 
varieties are included within the scope of this invention. 
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The nucleic acid sequences described herein may be 
used t:o clone the respective genomic DNA sequences in order 
to study the genes' structure and regulation. The murine 
and human cDNA or genomic sequences can also be used to 
5 isolate the homologous genes from other mammalian species. 
The mammalian DNA sequences can be used to study the 
receptors' functions in various in vitro and in vivo model 
systems . 

As exemplified below for ALK-5 cDNA, it is also 

10 recognised that, given the sequence information provided 
herein, the artisan could easily combine the molecules with 
a pertinent promoter in a vector, so as to produce a 
cloning vehicle for expression of the molecule. The 
promoter and coding molecule must be operably linked via 

15 any of the well-recognized and easily-practised 
methodologies for so doing. The resulting vectors, as well 
as the isolated nucleic acid molecules themselves, may be 
used to transform prokaryotic cells (e.g. E. coli) , or 
trans feet eukaryotes such as yeast (S. cerevisiae ^ , PAE, 

20 COS or CHO cell lines. Other appropriate expression 
systems will also be apparent to the skilled artisan. 

Several methods may be used to isolate the ligands for 
the ALKs. As shown for AIiK-5 cDNA, cDNA clones encoding 
the active open reading frames can be subcloned into 

25 expression vectors and transfected into eukaryotic cells, 
for excunple COS cells. The transfected cells which can 
express the receptor can be subjected to binding assays for 
radioactively- label led members of the TGF-6 super family 
(TGF-B, activins, inhibins, bone morphogenic proteins and 

30 mttllerian- inhibiting substances) , as it may be expected 
that the receptors will bind members of the TGF-B 
superf cunily. Various biochemical or cell-based assays can 
be designed to identify the ligands, in tissue extracts or 
conditioned media, for receptors in which a ligand is not 

35 known. Antibodies raised to the receptors may also be used 
to identify the ligands, using the immunoprecipitation of 
the cross-linked complexes. Alternatively, purified 
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receptor could be used to isolate the ligands using an 
affinity-based approach. The determination of the 
expression patterns of the receptors may also aid in the 
isolation of the ligand. These studies may be carried out 
5 using ALK DNA or RNA sequences as probes to perf ona in situ 
hybridisation studies. 

The use of various model systems or structural studies 
should enable the rational development of specific agonists 
and antagonists useful in regulating receptor function. It 

10 may be envisaged that these can be peptides, mutated 
ligands, antibodies or other molecules able to interact 
with the receptors. 

The foregoing provides examples of the invention 
Applicants intend to claim which includes, inter alia, 

15 isolated nucleic acid molecules coding for activin 
receptor-like kinases (ALKs) , as defined herein. These 
include such sequences isolated from mammalian species such 
as mouse, human, rat, rabbit and monkey. 

The following description relates to specific 

20 embodiments. It will be understood that the specification 
and exeimples are illustrative but not limitative of the 
present invention and that other embodiments within the 
spirit and scope of the invention will suggest themselves 
to those skilled in the art. 

25 Preparation of mRNA and Construction of a cDNA Librarv 

For construction of a cDNA library, poly (A)* RNA was 
isolated from a human erythro leukemia cell line (HEL 
92.1.7) obtained from the American Type Culture Collection 
(ATCC TIB 180) . These cells were chosen as they have been 

30 shown to respond to both activin and TGP-B. Moreover 
leukaemic cells have proved to be rich sources for the 
cloning of novel receptor tyrosine kinases (Partanen efc al 
(1990) Proc. Natl. Acad. Sci. USA 82, 8913-8917 and (1992) 
Mol. Cell. Biol, la, 1698-1707). (Total) RNA was prepared 

35 by the guanidinium isothiocyanate method (Chirgwin et al 
(1979) Biochemistry 18, 5294-5299). mRNA was selected 
using the poly-A or poly AT tract mRNA isolation kit 
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(Promega, Madison, Wisconsin, U.S.A.) as described by "the 
manufacturers, or piirified through an oligo (dT) -cellulose 
column as described by Aviv and Leder (1972) Proc. Natl. 
Acad. Sci. USA 69, 1408-1412. The isolated mRNA was used 
5 for the synthesis of random primed (Amersham) cDNA, that 
was used to make a A.gtlO library with 1x1 independent 
cDNA clones using the Riboclone cDNA synthesis system 
(Promega) and AgtlO in vitro packaging kit (Amersham) 
according to the manufacturers' procedures. An amplified 

10 oligo (dT) primed human placenta A.ZAPII cDNA library of 
5x1 o' independent clones was used. Poly (A)* RNA isolated 
from AG1518 human foreskin fibroblasts was used to prepare 
a primary random primed AZAPXI cDNA library of 1.5x10* 
independent clones using the RiboClone cDNA synthesis 

15 system and Gigapack Gold II packaging extract (Stratagene) . 
In addition, a primary oligo (dT) primed human foreskin 
fibroblast AgtlO cDNA library (Claesson-Welsh et ai (1989) 
Proc. Natl. Acad. Sci. USA. 86 4917-4912) was prepared. An 
amplified oligo (dT) primed HEL cell JLgtll cDNA library of 

20 1.5 X 10* independent clones (Poncz et sJL (1987) Blood 6S. 
219-223) was used. A twelve-day mouse embryo A.E XI0X cDNA 
library was obtained from Novagen (Madison, Wisconsin, 
U.S.A.); a mouse placenta A.ZAPII cDNA library was also 
used. 

25 Generation of cDNA Probes by PGR 

For the generation of cDNA probes by PGR (Lee et al 
(1988) Science 239 . 1288-1291) degenerate PGR primers were 
constructed based upon the amino-acid sequence similarity 
between the mouse activin type II receptor (Mathews and 

30 Vale (1991) Cell 65, 973-982) and daf -1 (George et al 
(1990) Cell 6i, 635-645) in the kinase domains II and VIII. 
Figure 1 shows the aligned serine/threonine kinase domains 
(I-VIII) , of foiir related receptors of the TGF-B 
superfamily, i.e. hTBR-II, mActR-IIB, mActR-II and the daf- 

35 1 gene product, using the nomenclature of the subdomains 
according to Hanks et ai (1988) Science 241 . 45-52, 
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Several considerations were applied in the design of 
the PGR primers. The sequences were taken from regions of 
homology between the activin type II receptor and the daf -1 
gene product, with particular emphasis on residues that 
5 confer serine/threonine specificity (see Table 2) and on 
residues that are shared by transmembrane kinase proteins 
and not by cytoplasmic kinases. The primers were designed 
so that each primer of a PGR set had an approximately 
similar GC composition, and so that self complementarity 

10 and complementarity between the 3' ends of the primer sets 
were avoided. Degeneracy of the primers was kept as low as 
possible, in particular avoiding serine, leucine and 
arginine residues (6 possible codons) , and human codon 
preference was applied. Degeneracy was particularly 

15 avoided at the 3' end as, unlike the 5' end, where 
mismatches are tolerated, mismatches at the 3' end 
dramatically reduce the efficiency of PGR. 

In order to facilitate directional subcloning, 
restriction enzyme sites were included at the 5' end of the 

20 primers, with a GG clamp, which permits efficient 
restriction enzyme digestion. The primers utilised are 
shown in Figure 2 . Oligonucleotides were synthesized using 
Gene assembler plus (Pharmacia - LKB) according to the 
manufacturers instructions . 

25 ' The mRNA prepared from HEL cells as described above 

was reverse-transcribed into cDNA in the presence of 50 mM 
Tris-HCl, pH 8.3, 8 mM MgGlj, 30 mM KGl, 10 mM 
dithiothreitol , 2mM nucleotide triphosphates, excess oligo 
(dT) primers and 34 \inits of AMV reverse transcriptase at 

30 42°C for 2 hours in 40 /il of reaction volume. 
Amplification by PGR was carried out with a 7.5% aliquot (3 
Ail) of the reverse-transcribed mRNA, in the presence of 10 
mM Tris-HGl, pH 8.3, 50 mM KGl, 1.5 M MgGlg, 0.01% gelatin, 
0.2 mM nucleotide triphosphates, 1 of both sense and 

35 antisense primers and 2.5 units of Taq polymerase (Perkin 
Elmer Cetus) in 100 /il reaction volume. Amplifications 
were performed on a thermal cycler (Perkin Elmer Getus) 
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using the following program: first 5 thermal cycles with 
denaturation for 1 minute at 94°C, annealing for 1 minute 
at SO^C, a 2 minute ramp to 55°C and elongation for 1 minute 
at IT^C, followed by 20 cycles of 1 minute at 94''c, 30 
5 seconds at 55**C and 1 minute at 72''c. A second round of PGR 
was performed with 2 ^il of the first reaction as a 
template. This involved 25 thermal cycles, each composed 
of 94°C (1 min) , 55°C (0.5 min) , 72°C (1 min) . 

General procedures such as purification of nucleic 

10 acids, restriction enzyme digestion, gel electrophoresis, 
transfer of nucleic acid to solid supports and subcloning 
were performed essentially according to established 
procedures as described by Sambrook et al, (1989) , 
Molecular cloning: A Laboratory Manual, 2"^ Ed. Cold Spring 

15 Harbor Laboratory (Cold Spring Harbor, New York, USA) . 

Samples of the PCR products were digested with BamHI 
and EcoRI and subsequently fractionated by low melting 
point agarose gel electrophoresis. Bands corresponding to 
the approximate expected sizes, (see Table 1: »460 bp for 

20 primer pair B3-S and E8-AS and « 140 bp for primer pair B7- 
S and E8-AS) were excised from the gel and the DNA was 
purified. Subsequently, these fragments were ligated into 
pUC19 (Yanisch-Perron et al (1985) Gene 33. 103-119), which 
had been previously linearised with Bam HI and EcoR l and 

25 transformed into coli strain DH5o using standard 

protocols (Sambrook et ai, supra ) . Individual clones were 
sequenced using standard double-stranded sequencing 
techniques and the dideoxynucleotide chain termination 
method as described by Sanger et al (1977) Proc. Natl. 

30 Acad. Sci. USA JA, 5463-5467, and T7 DNA polymerase. 

Employing Reverse Transcriptase PCR on HEL mRNA with 
the primer pair B3-S and E8-AS, three PCR products were 
obtained, termed 11.1, 11.2 and 11.3, that corresponded to 
novel genes. Using the primer pair B7-S and E8-AS, an 

35 additional novel PCR product was obtained termed 5.2. 
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TABLE 1 



1 NAME 
1 OP PCR 
1 PRODUCT 

1 


PRINBRS 


INSERX 
SIZE 

(bp) 


SIZE OP DMA 

PRAOMENT IM 

■ActRII/ 

hTBRZI 

CLONES 

(bp) 


SEQUENCE 
IDEMXXXY HZXB 
SEQUENCE 
■AcUUI/hTBRIZ 
(*) 


SEQUENCE 

IDENTITY 

BETWEEN 

aAc^ZI 

and 

TBR-II 




B3-S/E8-AS 


460 


460 


46/40 


42 


11.2 


B3-S/E8-AS 


460 


460 


49/44 


47 


11.3 


B3-S/E8-AS 


460 


460 


44/36 


48 


11-29 


B3-S/E8-AS 


460 


460 


ND/100 


ND 


9.2 


B1-S/E8-AS 


800 


795 


100/ND 


ND 


5.2 


B7-S/E8-AS 


140 


143 


40/38 


60 



15 Isolation of cDNA Clones 

The PCR products obtained were used to screen various 
cDNA libraries described supra . Labelling of the inserts 
of PCR products was performed using random priming method 
(Feinberg and Vogelstein (1983) Anal. Biochem, 132 6-13) 

20 using the Megaprime DNA labelling system (Amershcun) . The 
oligonucleotide derived from the sequence of the PCR 
product 5.2 was labelled by phosphorylation with T4 
polynucleotide Jcinase following standard protocols 
(Seunbrook et ai, supra ^ . Hybridization and purification of 

25 positive bacteriophages were performed using standard 
molecular biological techniques. 

The double-stranded DNA clones were all sequenced 
using the dideoxynucleotide chain-termination method as 
described by Sanger et aX, supra . using T7 DNA polymerase 

30 (Pharmacia - LKB) or Sequenase (U.S. Biochemical 
Corporation, Cleveland, Ohio, U.S.A.). Compressions of 
nucleotides were resolved using 7-deaza-GTP (U.S. 
Biochemical Corp.) DNA sequences were analyzed using the 
DNA STAR computer progrsoa (DNA STAR Ltd. U.K.). Analyses 

35 of the sequences obtained revealed the existence of six 
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dis-tinc-t putative receptor serine/threonine kinases which 
have been neuoaed ALK 1-6. 

To clone cDNA for ALK-l the oligo (dT) primed human 
placenta cDNA library was screened with a radiolabelled 
5 insert derived from the PGR product 11.3; based upon their 
restriction enzyme digestion patterns, three different 
types of clones with approximate insert sizes, of 1.7 kb, 
2 kb & 3.5 kb were identified. The 2 kb clone, neuned 
HP57, was chosen as representative of this class and 

10 subjected to complete sequencing. Sequence analysis of 
AIiK-1 revealed a sequence of 1984 nucleotides including a 
poly-A tail (SEQ ID No. 1) . The longest open reading frame 
encodes a protein of 503 amino-acids, with high sequence 
similarity to receptor serine/threonine kinases (see 

15 below) . The first methionine codon, the putative 
translation start site, is at nucleotide 283-285 and is 
preceded by an in- frame stop codon. This first ATG is in 
a more favourable context for translation initiation (Kozedc 
(1987) Nucl. Acids Res., 2=5r 8125-8148) than the second and 

20 third in-freuae ATG at nucleotides 316-318 and 325-327. The 
putative initiation codon is preceded by a 5' untranslated 
sequence of 282 nucleotides that is GC-rich (80% GC) , which 
is not uncommon for growth factor receptors (Kozedc (1991) 
J. Cell Biol., 115 . 887-903). The 3' untranslated sequence 

25 comprises 193 nucleotides and ends with a poly-A tail. No 
bona fide poly-A addition signal is found, but there is a 
sequence (AATACA) , 17-22 nucleotides upstream of the poly-A 
tail, which may serve as a poly-A addition signal. 

ALK-2 cDNA was cloned by screening an amplified oligo 

30 (dT) primed human placenta cDNA library with a 
radiolabelled insert derived from the PGR product 11.2. 
Two clones, termed HP53 and HP64, with insert sizes of 2.7 
kb and 2.4 kb respectively, were identified and their 
sequences were determined. No secpience difference in the 

35 overlapping clones was found, suggesting they are both 
derived from transcripts of the seune gene. 
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Sequence analysis of cDNA clone HP53 (SEQ ID No. 3) 
revealed a sequence of 2719 nucleosides with a poly-A tail. 
The longest open reading frame encodes a protein of 509 
amino-acids . The first AT6 at nucleotides 104->106 agrees 
5 favourably with Kozak's consensus sequence with an A at 
position 3. This ATG is preceded in-fraone by a stop codon. 
There are four ATG codons in close proximity further 
downstream, which agree with the Kozak's consensus sequence 
(KozeOc, supra ) , but according to KozeOc's scanning model the 

10 first ATG is predicted to be the translation start site. 
The 5' untranslated sequence is 103 nucleotides. The 3' 
untranslated sequence of 1089 nucleotides contains a 
polyadenylation signal located 9-14 nucleotides upstreeun 
from the poly-A tail. The cDNA clone HP64 lacks 498 

15 nucleotides from the 5 ' end compared to HP53 , but the 
sequence extended at the 3' end with 190 nucleotides and 
poly-A tail is absent. This suggests that different 
polyadenylation sites occiir for AIjK-2. In Northern blots, 
however, only one transcript was detected (see below) . 

20 The cDNA for human ALK-3 was cloned by initially 

screening an oligo (dT) primed human foreskin fibroblast 
cDNA library with an oligonucleotide (SEQ ID No. 23) 
derived from the PGR product 5.2. One positive cDNA clone 
with an insert size of 3 kb, termed ONll, was identified. 

25 However, upon partial sequencing, it appeared that this 
clone was incomplete; it encodes only part of the kinase 
domain and lacks the extracelluar domain. The most 5' 
sequence of ONll, a 540 nucleotide Xba l restriction 
fragment encoding a truncated kinase domain, was 

30 subsequently used to probe a random primed fibroblast cDNA 
library from which one cDNA clone with an insert size of 3 
kb, termed 0KF5, was isolated (SEQ ID No. 5) . Sequence 
analysis of ONF5 revealed a sequence of 2932 nucleotides 
without a poly-A tail, suggesting that this clone was 

35 derived by internal priming. The longest open reading 
frame codes for a protein of 532 amino-acids. The first 
ATG codon which is compatible with Kozsik's consensus 
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sequence (Kozak, supra 1 , is at 310-312 nucleotides and is 
preceded by an in-frame stop codon. The 5' and 3' 
untranslated sequences are 309 and 1027 nucleotides long, 
respectively . 

5 AIjK-4 cDNA was identified by screening a human bligo 

(dT) primed human erythroleukemia cDNA library with the 
radiolabelled insert of the PGR product 11.1 as a probe. 
One cDNA clone, termed 11H8, was identified with an insert 
size of 2 kb (SEQ ID No. 7) . An open reading frame was 

10 found encoding a protein sequence of 383 amino-acids 
encoding a truncated extracellular domain with high 
similarity to receptor serine/threonine kinases. The 3' 
untranslated sequence is 818 nucleotides and does not 
contain a poly-A tail, suggesting that the cDNA was 

15 internally primed. cDNA encoding the complete 

extracellular domain (nucleotides 1-366) was obtained from 
HEL cells by RT-PCR with 5' primer (SEQ ID No. 24) derived 
in part from sequence at translation start site of SKR-2 (a 
cDNA sequence deposited in GenBank data base, accesion 

20 number L10125, that is identical in part to ALK-4) and 3' 
primer (SEQ ID No. 25) derived from 11H8 cDNA clone. 

AIjK-5 was identified by screening the random primed 
HEL cell Xgt 10 cDNA library with the PGR product 11. 1 as 
a probe. This yielded one positive clone termed EMBIA 

25 (insert size of 5.3 kb with 2 internal EcoR I sites). 
Nucleotide sequencing revealed an open reading frame of 
1509 bp, coding for 503 amino-acids. The open reading 
frame was flanked by a 5' untranslated sequence of 76 bp, 
and a 3' untranslated sequence of 3.7 kb which was not 

3 0 completely sequenced. The nucleotide and deduced amino- 
acid sec[uences of ALK-5 are shown in SEQ ID Nos. 9 and 10. 
In the 5' part of the open reading frame, only one ATG 
codon was found; this codon fulfils the rules of 
translation initiation (Kozak, supra ^ . An in-frame stop 

35 codon was found at nucleotides (-54) -(-52) in the 5' 
untranslated region. The predicted ATG start codon is 
followed by a stretch of hydrophobic amino-acid residues 
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which has characteristics of a cleavable signal sequence. 
Therefore, the first ATG codon is likely to be used as a 
translation initiation site. A preferred cleavage site for 
the signal peptidase, according to von Heijne (1986) Nucl. 
5 Acid. Res. 14, 4683-4690, is located between amino-acid 
residues 24 and 25. The calculated molecular mass of the 
primary translated product of the ALK-5 without signal 
sequence is 53,646 Da. 

Screening of the mouse embryo kEX lox cDNA library 

10 using PGR, product 11.1 as a probe yielded 20 positive 
clones. DNAs from the positive clones obtained from this 
library were digested with EcoR I and Hindlll, 
electrophoretically separated on a 1.3% agarose gel and 
transferred to nitrocellulose filters according to 

15 established procedures as described by Sambrook et ai, 
supra . The filters were then hybridized with specific 
probes for human ALK-l (nucleotide 288-670) , AIiK-2 
(nucleotide 1-581) , AIiK-3 (nucleotide 79-824) or AIiK-4 
nucleotide 1178-1967) . Such analyses revealed that a clone 

20 teirmed ME-7 hybridised with the human AIiK-3 probe. 
However, nucleotide sequencing revealed that this clone was 
incomplete, and lacked the 5' part of the translated 
region. Screening the same cDNA library with a probe 
corresponding to the extracelluar domain of human AIjK-3 

25 (nucleotides 79-824) revealed the clone ME-D. This clone 
was isolated and the sequence was analyzed. Although this 
clone was incomplete in the 3' end of the translated 
region, ME-7 and ME-D overlapped and together covered the 
complete sequence of mouse ALK-3 . The predicted amino-acid 

30 sequence of mouse AIiK-3 is very similar to the human 
sequence; only 8 amino-acid residues differ (98% identity; 
see SEQ ID No. 14) and the calculated molecular mass of the 
primary translated product without the putative signal 
sequence is 57,447 Da. 

35 Of the clones obtained from the initial library 

screening with PGR product 11. 1, four clones hybridized to 
the probe corresponding to the conserved kinase domain of 
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AIjK-4 but not to probes from more divergent parts of ALK-l 
to -4 . Analysis of these clones revealed that they have an 
identical sequence which differs from those of ALK-l to -5 
and was termed ALK-6. The longest clone ME6 with a 2.0 kb 
5 insert was completely sequenced yielding a 1952 bp fragment 
consisting of an open reading frame of 1506 bp (502 amino- 
acids) , flanked by a 5' untranslated sequence of 186 bp, 
and a 3' untranslated sequence of 160 bp. The nucleotide 
and predicted amino-acid sequences of mouse ALK-e are shown 

10 in SEQ ID Nos. 17 and 18. No polyadenylation signal was 
found in the 3 ' untranslated region of ME6 , indicating that 
the cDNA was internally primed in the 3' end. Only one ATG 
codon was found in the 5' part of the open reading frcune, 
which fulfils the rules for translation initiation (KozeUc, 

15 supra ^ , and was preceded by an in-f raune stop codon at 
nucleotides 163-165. However, a typical hydrophobic leader 
sequence was not observed at the N terminus of the 
translated region. Since there is no ATG codon and 
putative hydrophobic leader sec^ience, this ATG codon is 

20 likely to be used as a translation initiation site. The 
calculated molecular mass of the primary translated product 
with the putative signal sequence is 55,576 Da. 

Mouse AIjK-1 (clone AM6 with 1.9 kb insert) was 
obtained from the mouse placenta A.ZAPZI cDNA library using 

25 human ALK-l cDNA as a probe (see SEQ ID No. 11) . Mouse 
ALK-4 (clone Sal with 2.3kb insert) was also obtained from 
this library using human ALK-4 cDNA library as a probe (SEQ 
ID No. 15) . 

To summarise, clones HP22, HP57, ONFl, ONF3, ONF4 and 
30 HP29 encode the same gene, ALK-l. Clone AM6 encodes mouse 
ALK-l. HP53, HP64 and HP84 encode the same gene, ALK-2 . 
ONF5 , ONF2 and ONll encode the same gene ALK-3 . ME-7 and 
ME-D encode the mouse counterpart of human ALK-3. 11H8 
encodes a different gene ALK-4 , whilst Sal encodes the 
35 mouse equivalent. EHBLA encodes ALK-5, and ME-6 encodes 
ALK-6. 



wo 94/11502 




PCr/GB93/02367 



18 



The sequence alignment between the 6 ALK genes and 
TBR-II, mActR-II and ActR-IIB is shown in Figiire 3. These 
molecules have a similar domain structure; an N-terminal 
predicted hydrophobic signal sequence (von Heijne (1986) 
5 Nucl. Acids Res. JJ.: 4683-4690) is followed by a relatively 
small extracellular cysteine-rich ligand binding domain, a 
single hydrophobic transmembrane region (Kyte & Doolittle 
(1982) J. Mol. Biol. 157 . 105-132) and a C-terminal 
intracellular portion, which consists almost entirely of a 

10 kinase domain (Figures 3 and 4) . 

The extracelluar domains of these receptors have 
cysteine-rich regions, but they show little sequence 
similarity; for example, less than 20% sec[uence identity is 
found between Daf -1. ActR-II, TBR-II and ALK-S. The ALKs 

15 appear to form a subfamily as they show higher sequence 
similarities (15-47% identity) in their extracellular 
domains. The extracellular domains of ALK-S and AIiK-4 have 
about 29% sequence identity. In addition, AIjK-3 and ALK-6 
share a high degree of sequence similarity in their 

20 extracellular domains (46% identity) . 

The positions of many of the cysteine residues in all 
receptors can be aligned, suggesting that the extracellular 
domains may adopt a similar structviral configuration. See 
Figxire 5 for ALKs-1,-2,-3 &- 5. Each of the ALKs (except 

25 ALK-6) has a potential N-linked glycosylation site, the 
position of which is conserved between ALK-l and AItK-2 , and 
between ALK-3, AIiK-4 and ALK-S (see Figure 4). 

The sequence similarities in the kinase domains 
between daf-l, ActR-II, TBR-II and ALK-5 are approximately 

30 40%, whereas the sec[uence similarity between the ALKs 1 to 
6 is higher (between 59% and 90%; see Figvire 6). Pairwise 
comparison using the Jutun-Hein sequence alignment program 
(Hein (1990) Meth, Enzymol., 183 . 626-645), between all 
feu&ily members, identifies the ALKs as a separate subclass 

35 aunong serine/ threonine kinases (Figure 7) . 

The catalytic domains of kinases can be divided into 
12 subdomains with stretches of conserved amino-acid 
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residues. The key motifs are found in serine/threonine 
kinase receptors suggesting that they are functional 
kinases. The consensus sequence for the binding of ATP 
(Gly-X-Gly-X-X-Gly in subdomain I followed by a Lys residue 
5 further downstream in subdomain II) is found in all the 
AliKs. 

The kinase domains of daf -1 . ActR-II, and ALKs show 
approximately equal sequence similarity with tyrosine and 
serine/threonine protein kinases. However analysis of the 

10 amino-acid sequences in subdomains VI and VIII, which are 
the most useful to distinguish a specificity for 
phosphorylation of tyrosine residues versus 
serine/threonine residues (Hanks et al (1988) Science 241 
42-52) indicates that these kinases are serine/ threonine 

15 kinases ; refer to Table 2 . 
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TABLE 2 



KIHASE 


ST7BDOMAIMS 




VIB 


VIII 


SerineZ-threonine kinase consensus 


DLKPEN 


G (T/S) XX 
(Y/F) X 


Tyrosine kinase consensus 


DLAARN 


XP(I/V) 
(K/R) W 


Act R-II 


DTKSKN 


GTRRYM 


Act R-IIB 


DFKSKN 


GTRRYM 


T6R-II 


DLKSSN 


GTARYM 


ALK-I 


DFKSRN 


GTKRYM 


ALK -2, -3, -4, -5, & -6 


DliKSKN 


GTKRYM 



The sequence motifs DIiKSKN (Subdomain VIB) and GTKRYM 
(Subdomain VIII) , that are found in most of the 
serine/threonine kinase receptors, agree well with the 

15 consensus sequences for all protein serine/ threonine kinase 
receptors in these regions. In addition, these receptors, 
except for ALK-1, do not have a tyrosine residue surrounded 
by acidic residues between subdomains VII and VIII, which 
is common for tyrosine kinases. A unique characteristic of 

20 the members of the ALK serine/ threonine kinase receptor 
family is the presence of two short inserts in the kinase 
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domain between subdomains VIA and VIB and between 
subdomains X and XI. In the intracellular domain, these 
regions, together with the juxtamembrane part and C- 
terminal tail, are the most divergent between family 
5 members (see Figures 3 and 4) . Based on the sequence 
similarity with the type II receptors for TGF-fi and 
activin, the C termini of the kinase domains of AliKs -l to 
-6 are set at Ser-4S5, Ser-501, Ser-527, Gln-500, Gln-'498 
and Ser-497, respectively, 

10 mRNA Expression 

The distribution of ALK-l, -2, -3, -4 was determined 
by Northern blot analysis. A Northern blot filter with 
mRNAs from different human tissues was obtained from 
Clontech (Palo Alto, C.A.). The filters were hybridized 

15 with ^^P-labelled probes at 42**C overnight in 50% 
formaldehyde, 5 x standard saline citrate (SSC; IxSSC is 
5CmM sodium citrate, pH 7.0, 150 mM NaCl) , 0.1% SDS, 50 mM 
sodium phosphate, 5 x Denhardt's solution and 0.1 mg/ml 
salmon sperm DNA. In order to minimize cross- 

20 hybridization, probes were used that did not encode part of 
the kinase domains, but corresponded to the highly diverged 
sequences of either 5' untranslated and ligand-binding 
regions (probes for ALK-l, -2 and -3) or 3 ' untranslated 
sequences (probe for ALK->4) . The probes were labelled by 

25 random priming using the Multiprime (or Mega-prime) DNA 
labelling system and [ot-^^P] dCTP (Feinberg & Vogelstein 
(1983) Anal. Biochem. 132 ; 6-13) . Unincorporated label was 
removed by Sephadex G-25 chromatography. Filters were 
washed at 65°C, twice for 3 0 minutes in 2.5 x SSC, 0.1% SDS 

30 and twice for 30 minutes in 0.3 x SSC, 0.1% SDS before 
being exposed to X-ray film. Stripping of blots was 
performed by incubation at 90-100°C in water for 20 
minutes . 

The AIjK-5 mRNA size and distribution were determined 
35 by Northern blot analysis as above. An EcoR l fragment of 
980bp of the full length ALK-5 cDNA clone, corresponding to 
the C-terminal part of the kinase domain and 3 ' 
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unt:ranslated region (nucleot:ides 1259-2232 in SEQ ID No. 9) 
was used as a probe. The filter was washed twice in 0.5 x 
SSC, 0.1% SDS at SS^C for 15 minutes. 

Using the probe for AIoK-l.. two transcripts of 2.2 and 
5 4.9kb were detected. The ALK-1 expression level varied 
strongly between different tissues, high in placenta and 
lung, moderate in heart, muscle and kidney, and low (to not 
detectable) in brain, liver and pancreas. The relative 
ratios between the two transcripts were similar in most 

10 tissues; in kidney, however, there was relatively more of 
the 4.9 kb transcript. By reprobing the blot with a probe 
for ALK-2 , one transcript of 4.0 kb was detected with a 
ubiquitous expression pattern. Expression was detected in 
every tissue investigated and was highest in placenta and 

15 skeletal muscle. Subsequently the blot was reprobed for 
AIiK-3 . One major transcript of 4.4 kb and a minor 
transcript of 7.9 kb were detected. Expression was high in 
skeletal muscle, in which also an additional minor 
transcript of 10 kb was observed. Moderate levels of AIjK-3 

20 mRNA were detected in heart, placenta, kidney and pancreas, 
and low (to not detectable) expression was found in brain, 
lung and liver. The relative ratios between the different 
transcripts were similar in the tested tissues, the 4.4 kb 
transcript being the predominant one, with the exception 

25 for brain where both transcripts were expressed at a 
similar level. Probing the blot with AIiK-4 indicated the 
presence of a transcript with the estimated size of 5.2 kb 
and revealed an ubiquitous expression pattern. The results 
of Northern blot analysis using the probe for AIiK-5 showed 

30 that a 5.5 kb transcript is expressed in all human tissues 
tested, being most abvmdant in placenta and least abundant 
in brain and heart. 

The distribution of mRNA for mouse ALK-3 and -6 in 
various mouse tissues was also determined by Northern blot 

35 analysis. A multiple mouse tissue blot was obtained from 
Clontech, Palo Alto, California, U.S.A. The filter was 
hybridized as described above with probes for mouse AIiK-3 
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and AIiK-6. The EcoR I- Pst l res-triction fragment, 

corresponding to nucleotides 79-1100 of ALK-S , and the 
Sac l- Hpa l fragment, corresponding to nucleotides 57-720 of 
ALK-6, were used as probes. The filter was washed at 65°C 
5 twice for 30 minutes in 2.5 x SSC, 0.1% SDS and twice for 
30 minutes with 0.3 x SSC, 0.1% SDS and then subjected to 
autoradiography . 

Using the probe for mouse AIjK-3, a l.l kb transcript 
was found only in spleen. By reprobing the blot with the 

10 AIiK-6 specific probe, a transcript of 7.2 kb was found in 
brain and a weeik signal was also seen in lung. No other 
signal was seen in the other tissues tested, i.e. heart, 
liver, skeletal muscle, kidney and testis. 

All detected transcript sizes were different, and thus 

15 no cross-reaction between mRNAs for the different ALKs was 
observed when the specific probes were used. This suggests 
that the multiple transcripts of ALK-l and AIiK-3 are coded 
from the same gene. The mechanism for generation of the 
different transcripts is unknown at present; they may be 

20 formed by alternative mRNA splicing, differential 
polyadenylation, use of different promoters, or by a 
combination of these events. Differences in mRNA splicing 
in the regions coding for the extracellular domains may 
lead to the synthesis of receptors with different 

25 affinities for ligands, as was shown for mActR-IIB 
(Attisano et aX (1992) Cell 68, 97-108) or to the 
production of soluble binding protein. 

The above experiments describe the isolation of 
nucleic acid sequences coding for new feuaily of human 

30 receptor kinases. The cDNA for AIiK-5 was then used to 
determine the encoded protein size and binding properties. 
Properties of t^f* ft^.tcg «--n HA Encoded Proteiyis 

To study the properties of the proteins encoded by the 
different ALK cDNAs, the cDNA for each ALK was subcloned 

35 into a eukaryotic expression vector and transfected into 
various cell types and then subjected to 
immtinoprecipitation using a rabbit antiserum raised against 
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a synthetic peptide corresponding to part of the 
intracellular juxtamembrane region. This region is 
divergent in sequence between the various serine/threonine 
kinase receptors. The following amino-acid residues were 
5 used: 

ALK-l 145-166 
ALK-2 151-172 
ALK-3 181-202 
ALK-4 153-171 

10 ALK-5 158-179 
ALK-6 151-168 

The rabbit antiserum against ALK-5 was designated VPN. 
The peptides were synthesized with an Applied 
Biosystems 430A Peptide Synthesizer using t-butoxycarbonyl 

15 chemistry and purified by reversed-phase high performance 
liquid chromatography. The peptides were coupled to 
keyhole limpet haemocyanin (Calbiochem-Behring) using 
glutar aldehyde, as described by Guillick et al (1985) EMBO 
J. 4, 2869-2877. The coupled peptides were mixed with 

20 Freunds adjuvant and used to immunize rabbits. 
Transient transfection of the AIiK-5 cDNA 

COS-1 cells (American Type Culture Collection) and the 
R mutant of MvlLu cells (for references, see below) were 
cultured in Dulbecco's modified Eagle's medium containing 

25 10% fetal bovine seriim (FBS) and 100 units/ml penicillin 
and 50 ^g 1ml streptomycin in 5% COg atmosphere at 37**C, 
The AIiK-5 cDNA (nucleotides (-76) - 2232) , which includes 
the complete coding region, was cloned in the pSV7d vector 
(Truett et al, (1985) DNA 4, 333-349), and used for 

30 transfection. Transfection into COS-1 cells was performed 
by the calcium phosphate precipitation method (Wigler et al 
(1979) Cell 16, 777-785). Briefly, cells were seeded into 
6-well cell culture plates at a density of 5x10^ 
cells/well, and transfected the following day with 10 /ng of 

35 recombinant plasmid. After overnight incubation, cells 
were washed three times with a buffer containing 25 mM 
Tris-HCl, pH 7.4, 138 mM NaCl, 5 mM KCl, 0.7 mM CaClg, 0.5 
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mM MgClj and 0.6 mM NagHPO^, and then incubated with 
Dulbecco's modified Eagle's medium containing FBS and 
antibiotics. Two days after transf ection, the cells were 
metabolically labelled by incubating the cells for 6 hours 
5 in methionine and cysteine-free MCDB 104 medium with 150 
/iCi/ml of [^^S] -methionine and [^^S] -cysteine (in vivo 
labelling mix; Amersham) . After labelling, the cells were 
washed with 150 mM NaCI, 25 mM Tris-HCl, pH 7.4, and then 
solubilized with a buffer containing 20mM Tris-HCl, pH 7.4, 

10 150 mM NaCl, 10 mM EDTA, 1% Triton X-100, 1% deoxycholate, 
1.5% Trasylol (Bayer) and l mM phenylmethylsulfonylf luoride 
(PMSF; Sigma) . After 15 minutes on ice, the cell lysates 
were pelleted by centrifugation, and the supernatants were 
then incubated with 7 /il of preimmune serum for 1.5 hours 

15 at 4*C. Samples were then given 50 fil of protein A- 
Sepharose (Pharmacia-LKB) slurry (50% packed beads in 150 
mM NaCl, 2 0 mM Tris-HCl, pH 7.4, 0.2% Triton XlOO) and 
incubated for 45 minutes at 4''c. The beads were spun down 
by centrifugation, and the supernatants (1 ml) were then 

20 incubated with either 7 /xl of preimmune serum or the VPN 
antiseriom for 1.5 hours at 4'c. For blocking, 10 ng of 
peptide was added together with the antiserum. Immune 
complexes were then given 50 fil of protein A-Sepharose 
(Pharmacia - LKB) slurry (50% packed beads in 150 mM NaCl, 

25 20mM Tris-HCl, pH 7.4, 0.2% Triton X-100) and incubated for 
45 minutes at 4''c. The beads were spun down and washed 
four times with a washing buffer (20 mM Tris-HCl, pH 7.4, 
500 mM NaCI, 1% Triton X-100, 1% deoxycholate and 0.2% 
SDS) , followed by one wash in distilled water. The immune 

30 complexes were eluted by boiling for 5 minutes in the SDS- 
ssunple buffer (100 mM Tris-HCl, pH 8.8, 0.01% bromophenol 
blue, 3 6% glycerol, 4% SDS) in the presence of 10 mM DTT, 
and analyzed by SDS-gel electrophoresis using 7-15% 
polyacrylamide gels (Blobel and Dobberstein, (1975) J. Cell 

35 Biol. 67, 835-851). Gels were fixed, incubated with 
Amplify (Amersham) for 20 minutes, and subjected to 
f luorography. A component of 53Da was seen. This 
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component was not: seen when preimmune sertim was used, or 
when 10 ^ig blocking peptide was added together with the 
antiserum. Moreover, it was not detectable in samples 
derived from untransf ected COS-1 cells using either 
5 preimmune serum or the antiservim. 
Digestion with Endoalycosidase F 

Samples immunoprecipitated with the VPN antisera 
obtained as described above were incubated with 0.5 U of 
endoglycosidase F (Boehringer Mannheim Biochemica) in a 

10 buffer containing 100 mM sodium phosphate, pH 6.1, 50 mM 
EDTA, 1% Triton X-100, 0.1% SDS and 1% 6-mercaptoethanol at 
37**C for 24 hours. Samples were eluted by boiling for 5 
minutes in the SDS-sample buffer, and analyzed by SDS- 
polyacrylamide gel electrophoresis as described above. 

15 Hydrolysis of N-linked carbohydrates by endoglycosidase F 
shifted the 53 kDa band to 51 kDa. The extracelluar domain 
of AIjK-5 contains one potential acceptor site for N- 
glycosylation and the size of the deglycosylated protein is 
close to the predicted size of the core protein. 

20 Establishment of PAE Cell Lines Expressing ALK-5 

In order to investigate whether the ALK-5 cDNA encodes 
a receptor for TGF-B, porcine aortic endothelial (PAE) 
cells were transfected with an expression vector containing 
the AIjK-5 cDNA, and analyzed for the binding of ^^I-TGF-61. 

25 PAE cells were cultured in Hcim's F-12 medium 

supplemented with 10% FBS and antibiotics (Miyazono et al. , 
(1988) J. Biol. Chem. 263 . 6407-6415). The AIiK-5 cDNA was 
cloned into the cytomegalovirus (CMV) -based expression 
vector pcDNA I/NEO (Invitrogen) , and transfected into PAE 

30 cells by electroporation. After 48 hoiirs, selection was 
initiated by adding Geneticin (G418 sulphate; Gibco - BRL) 
to the culture mediiim at a final concentration of 0.5 mg/ml 
(Westermark et ai. , (1990) Proc. Natl. Acad. Sci. USA 87 . 
128-132) . Several clones were obtained, and after analysis 

35 by immunoprecipitation using the VPN antiserum, one clone 
denoted PAE/TBR-1 was chosen and further analyzed. 
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Tedination of TGF-Bl. Binding and Affinity Crosslinkina 

Recombinant human TGF-Bl was iodinatied using the 
chloramine T method according to Frolik et ai. , (1984) J. 
Biol. Chem. 259 . 10995-11000. Cross-linking experiments 
5 were performed as previously described (Ichijo et al. . 
(1990) Exp. Cell Res. 187 . 263-269). Briefly, cells in 6- 
well plates were washed with binding buffer (phosphate- 
buffered saline containing 0.9 mM CaClg, 0.49 mM MgClg and 
1 mg/ml bovine seriim albumin (BSA) ) , and incubated on ice 

10 in the same buffer with ^^I-TGF-61 in the presence or 
absence of excess unlabelled TGF-fil for 3 hours. Cells 
were washed and cross-linking was done in the binding 
buffer without BSA together with 0.28 mM disuccinimidyl 
suberate (DSS; Pierce Chemical Co.) for 15 minutes on ice. 

15 The cells were harvested by the addition of 1 ml of 
detachment buffer (10 mM Tris-HCl, pH 7.4, 1 mM EDTA, 10% 
glycerol, 0.3 mM PMSF) . The cells were pelleted by 
centrifugation, then resuspended in 50 /il of solubilization 
buffer (125 mM NaCl, 10 mM Tris-HCl, pH 7.4, 1 mM EDTA, 1% 

20 Triton X-100, 0.3 mM PMSF, 1% Trasylol) and incubated for 
40 minutes on ice. Cells were centrifuged again and 
supernatants were subjected to analysis by SDS-gel 
electrophoresis using 4-15% polyacryleunide gels, followed 
by autoradiography. ^^I-TGF-Bl formed a 70 kDa cross- 

25 linked complex in the transfected PAE cells (PAE/TBR-I 
cells) . The size of this complex was very similar to that 
of the TGF-B type I receptor complex observed at lower 
amounts in the untransf ected cells. A concomitant increase 
of 94 kDa TGF-fi type II receptor complex could also be 

30 observed in the PAE/TBR-I cells. Components of 150-190 
kDa, which may represent crosslinked complexes between the 
type I and type II receptors, were also observed in the 
PAE/TBR-I cells. 

In order to determine whether the cross-linked 70 kDa 

35 complex contained the protein encoded by the AIiK-5 cDNA, 
the affinity cross-linking was followed by 
immunoprecipitation using the VPN antiserim. For this, 
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cells in 25 cm flasks were used. The supernatants 
obtained after cross-linking were incubated with 7 nl of 
preimmune sertm or VPN antiserum in the presence or absence 
of 10 of peptide for 1.5h at 4°C. Immune complexes were 
5 then added to 50 /xl of protein A-Sepharose slurry and 
incubated for 45 minutes at 4*'c. The protein A-Sepharose 
beads were washed four times with the washing buffer, once 
with distilled water, and the samples were analyzed by SDS- 
gel electrophoresis using 4-15% polyacryleimide gradient 

10 gels and autoradiography. A 70 kDa cross-linked complex 
was precipitated by the VPN antiserum in PAE/T6R-1 cells, 
and a weaker band of the same size was also seen in the 
untransf ected cells, indicating that the untransf ected PAE 
cells contained a low amount of endogenous ALK-5. The 70 

15 kDa complex was not observed when preimmune serum was used, 
or when immune serum was blocked by 10 ng of peptide. 
Moreover, a coprecipitated 94 kDa component could also be 
observed in the PAE/TBR-I cells. The latter component is 
likely to represent a TGF-fl type II receptor complex, since 

20 an antiserum, termed DRL, which was raised against a 
synthetic peptide from the C-terminal part of the TGF-B 
type II receptor, precipitated a 94 kDa TGF-6 type II 
receptor complex, as well as a 70 kDa type I receptor 
complex from PAE/TBR-I cells. 

25 The carbohydrate contents of ALK-5 and the TGF-6 type 

II receptor were characterized by deglycosylation using 
endog lycos idase F as described above and analyzed by SDS- 
polyacrylamide gel electrophoresis and autoradiography. 
The ALK-5 cross-linked complex shifted from 70 kDa to 66 

30 kDa, whereas that of the type II receptor shifted from 94 
kDa to 82 kDa. The observed larger shift of the type II 
receptor band compared with that of the ALK-5 band is 
consistent with the deglycosylation data of the type I and 
type II receptors on rat liver cells reported previously 

35 (Cheifetz et ai (1988) J. Biol. Chem. 263 . 16984-16991), 
and fits well with the fact that the porcine TGF-fl type II 
receptor has two N-glycosylation sites (Lin et al, (1992) 
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Cell 68, 775-785) , whereas ALK-S has only one (see SEQ ID 
No. 9) . 

Binding of TGF-fll to the type I receptor is known to 
be abolished by transient treatment of the cells with 
5 dithiothreitol (DTT) (Cheifetz and Massague (1991) J. Biol. 
Chem. 266, 20767-20772; Wrana et al (1992) Cell 71, 1003- 
1014) . When analyzed by affinity cross-linking, binding of 

125 

I-TGF-Bl to ALK-5, but not to the type II receptor, was 
completely abolished by DTT treatment of PAE/T6R-1 cells. 

10 Affinity cross-linking followed by immunoprecipitation by 
the VPN antiserum showed that neither the AIjK-5 nor the 
type II receptor complexes was precipitated after DTT 
treatment, indicating that the VPN antiserum reacts only 
with AIiK-5. The data show that the VPN antiserum 

15 recognizes a TGF-B type I receptor, and that the type I and 
type II receptors form a heteromeric complex. 
^^^I-TGF-Bl Binding & Affi nity Crosslinking of Transfected 
COS Cells 

Transient expression plasmids of ALKs -1 to -6 and 

20 TBR-II were generated by subcloning into the pSV7d 
expression vector or into the pcDNA I expression vector 
(Invitrogen) . Transient transfection of COS-l cells and 
iodination of TGF-61 were carried out as described above. 
Crosslinking and immunoprecipitation were performed as 

25 described for PA£ cells above. 

Transfection of cDNAs for ALKs into COS-l cells did 
not show any appreciable binding of ^^I-TGFBl, consistent 
with the observation that type I receptors do not bind TGF- 
fl in the absence of type II receptors. When the TBR-II 

30 cDNA was co-trans fected with cDNAs for the different ALKs, 
type I receptor-like complexes were seen, at different 
levels, in each case. COS-l cells transfected with TBR-II 
and ALK cDNAs were analyzed by affinity crosslinking 
followed by immunoprecipitation using the DRL antisera or 

35 specific antisera against ALKs, Each one of the ALKs bound 
'^I-TGF-fll and was coimmunoprecipitated with the TBR-II 
complex using the DRL antisenua. Comparison of the 
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efficiency of the different ALKs to form heteromeric 
complexes with TBR-II, revealed that ALK-5 formed such 
complexes more efficiently than the other ALKs. The size 
of the crosslinked complex was larger for ALK-3 than for 
5 other ALKs, consistent with its slightly larger size. 
Expression of the ALK Pr otein in Different Cell Types 

Two different approaches were used to elucidate which 
ALK's are physiological type I receptors for TGF-6. 

Firstly, several cell lines were tested for the 

10 expression of the ALK proteins by cross-linking followed by 
immunoprecipitation using the specific antiseras against 
ALKs and the TGF-B type II receptor. The mink lung 
epithelial cell line, MvlLu, is widely used to provide 
target cells for TGF-6 action and is well characterized 

15 regarding TGF-B receptors (Laiho et ai (1990) J. Biol, 
Chem, 265 . 18518-18524; Laiho gt al (1991) J. Biol. Chem. 
266. 9108-9112) . Only the VPN antiserxim efficiently 
precipitated both type I and type II TGF-B receptors in the 
wild type MvlLu cells. The DRL antisenom also precipitated 

20 components with the same size as those precipitated by the 
VPN antiser\im. A mutant cell line (R mutant) which lacks 
the TGF-B type I receptor and does not respond to TGF-B 
(Laiho et al, supra ^ was also investigated by cross-linking 
followed by immunoprecipitation. Consistent with the 

25 results obtained by Laiho et al (1990) , supra the type III 
and type II TGF-B receptor complexes, but not the type I 
receptor complex, were observed by affinity crosslinking. 
Crosslinking followed by immunoprecipatition using the DRL 
antiserum revealed only the type II receptor complex, 

30 whereas neither the type I nor type II receptor complexes 
was seen using the VPN antiserum. When the cells were 
metabolically labelled and subjected to immunoprecipitation 
using the VPN antisenzm, the 53 kDa ALK-5 protein was 
precipitated in both the wild- type and R mutant MvlLu 

35 cells. These results suggest that the type I receptor 
expressed in the R mutant is ALK-S, which has lost the 
affinity for binding to TGF-B after mutation. 
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The type I and type II TGF-6 receptor complexes could 
be precipitated by the VPN and DRL antisera in other cell 
lines^ including hiunan foreskin fibroblasts (AG1518) , human 
lung adenocarcinoma cells (A549) , and human oral squamous 
5 cell carcinoma cells (HSC-2) . Affinity cross-linking 
studies revealed multiple TGF-B type I receptor-like 
complexes of 70-77 kDa in these cells. These components 
were less efficiently competed by excess unlabelled TGF-Bl 
in HSC-2 cells. Moreover, the type II receptor complex was 

10 low or not detectable in A549 and HSC-2 cells. Cross- 
linking followed by immunoprecipitation revealed that the 
VPN antiserum precipitated only the 70 kDa complex among 
the 70-77 kDa components. The DRL antiserum precipitated 
the 94 kDa type II receptor complex as well as the 70 kDa 

15 type I receptor complex in these cells , but not the 
putative type I receptor complexes of slightly larger 
sizes. These results suggest that multiple type I TGF-B 
receptors may exist and that the 70 kDa complex containing 
ALK-5 forms a heteromeric complex with the TGF-6 type II 

20 receptor cloned by Lin et ai (1992) Cell 68, 775-785, more 
efficiently that the other species. In rat 

pheochromocytoma cells (PC12) which have been reported to 
have no TGF-B receptor complexes by affinity cross-linking 
(Massagug et al (1990) Ann. N.Y. Acad. Sci. 593 . 59-72), 

25 neither VPN nor DRL antisera precipitated the TGF-B 
receptor complexes. The antisera against ALKs -1 to -4 and 
AIiK6 did not efficiently immunoprecipitate the crosslinked 
receptor complexes in porcine aortic endothelial (PAE) 
cells or human foreskin fibroblasts. 

30 Next, it was investigated whether ALKs could restore 

responsiveness to TGF-B in the R mutant of MvlLu cells, 
which lack the ligand-binding ability of the TGF-B type I 
receptor but have intact type II receptor. Wild-type MvlLu 
cells and mutant cells were rransfected with ALK cDNA and 

35 were then assayed for the production of plasminogen 
activator inhibitor-1 (PAI-1) which is produced as a result 
of TGF-B receptor activation as described previously by 
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Laiho et al (1991) Mol. Cell Biol. XI., 972-978. Briefly, 
cells were added with or without 10 ng/ml of TGF-Bl for 2 
hours in serum-free MCDB 104 without methionine. 
Thereafter, cultures were labelled with [^^S] methionine (40 
5 |iCi/ml) for 2 hours. The cells were removed by washing on 
ice once in PBS, twice in 10 mM Tris-HCl (pH 8.0), 0.5% 
sodium deoxycholate, 1 mM PMSF, twice in 2 mH Tris-HCl (pH 
8.0), and once in PBS. Extracellular matrix proteins were 
extracted by scraping cells into the SDS-sample buffer 

10 containing DTT, and analyzed by SDS-gel electrophoresis 
followed by fluorography using Amplify. PAI-1 can be 
identified as a characteristic 45kDa band (Laiho e£ ai 
(1991) Mol. Cell Biol. 11, 972-978). Wild-type MvlLu cells 
responded to TGF-B and produced PAI-1, whereas the R mutant 

15 clone did not, even after stimulation by TGF-Bl. Transient 
transfection of the ALK-5 cDNA into the R mutant clone led 
to the production of PAI-1 in response to the stimulation 
by TGF-Bl, indicating that the AUC-S cDNA encodes a 
fimctional TGF-B type I receptor. In contrast, the R 

20 mutant cells that were transfected with other ALKs did not 
produce PAI-1 upon the addition of TGF-Bl. 

Using similar approaches as those described above for 
the identification of TGF-6-binding ALKs, the ability of 
ALKs to bind activin in the presence of ActRII was 

25 examined. COS-l cells were co-transf ected as described 
above. Recombinant human activin A was iodinated using the 
chlorsunine T method (Mathews and Vale (1991) Cell 65, 973- 
982) . Transfected COS-l cells were analysed for binding 
and crosslinking of ^^I-activin A in the presence or 

30 absence of excess unlabelled activin A. The crosslinked 
complexes were subjected to immunoprecipitation using DRL 
antisera or specific ALK antisera. 

All ALKs appear to bind activin A in the presence of 
Act R-II. This is more clearly demonstrated by affinity 

35 cross-linking followed by immunopreciptation. ALK-2 and 
AIjK-4 bound '^I-activin A and were coimmunoprecipitated 
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wi-th ActR-II. Other ALKs also bound I-activin A but with 
a lower efficiency compared to AIiK-2 and ALK-4. 

In order to investigate whether ALKs are physiological 
activin type I receptors, activin responsive cells were 
5 examined for the expression of endogenous activin type I 
receptors. MvlLu cells, as well as the R mutant, express 
both type I and type II receptors for activin, and the R 
mutant cells produce PAI-1 upon the addition of activin A. 
MvlLu cells were labeled with ^^I-activin A, cross-linked 

10 and immunoprecipitated by the antisera against ActR-II or 
AliKs as described above. 

The type I and type II receptor complexes in MvlLu 
cells were immunoprecipitated only by the antisera against 
ALK-2, ALK-4 and ActR-II. Similar results were obtained 

15 using the R mutant cells. PAE cells do not bind activin 
because of the lack of type II receptors for activin, and 
so cells were transfected with a chimeric receptor, to 
enable them to bind activin, as described herein. A 
plasmid (chim A) containing the extracelluar domain and C- 

20 terminal tail of Act R-II (amino-acids -19 to 116 and 465 
to 494, respectively (Mathews and Vale (1991) Cell, 65, 
973-982)) and the kinase domain of TBR-II (amino-acids 160- 
543) (Lin et ai (1992) Cell, 68, 775-785) was constructed 
and transfected into pcDNA/neo (Invitrogen) . PAE cells 

25 were stably transfected with the chim A plasmid by 
electroporation, and cells expressing the chim A protein 
were established as described previously. PAE/ Chim A cells 

• 125 

were then subjected to "l-activin A labelling crosslinking 
and immunoprecipitation as described above. 

30 Similar to MvlLu cells, activin type I receptor 

complexes in PAE/ Chim A cells were immunoprecipitated by 
the ALK-2 and ALK-4 antisera. These results show that both 
ALK-2 and AIiK-4 serve as high affinity type I receptors for 
activin A in these cells. 

35 ALK-1, ALK-3 and ALK-6 bind TGF-Bl and activin A in 

the presence of their respective type II receptors, but the 
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functional consequences of the binding of the ligands 
remains to be elucidated. 

The invention has been described by way of example 
only, without restriction of its scope. The invention is 
5 defined by the subject matter herein, including the claims 
that follow the immediately following full Sequence 
Listings . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT t 

(A) NAMEt Ludwlg Institute for Cancer Research 

(B) STREET: St. Mary's Hospital Medical School, Norfolk 

Place 

(C) CITY: Paddington, London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): W2 IPG 

(11) TITLE OF INVENTION: PROTEINS HAVING SERINE /THREONINE KINASE 
DOMAINS, CORRESPONDING NUCLEIC ACID MOLECULES, AND THEIR 
USB 

(ill) NUMBER OF SEQUENCES: 29 

(Iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /M$-OOS 

(D) SOFTWARE: Patentin Release #1.0, Version /I. 25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(ill) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 283.. 1791 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AGGAAACGGT TTATTAGCAC GCAGTGGTGG AGCTGGGCCA GGCAGGAAGA OSCTCCAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCCC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 



SUBSTTTUTE SHEET 
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AGGCTAGCCC CCCCCCACCC GCAGACCGGG CCCAGAGGGA CC ATG ACC TTG GGC 294 

Met Thr Leu Gly 

1 

TCC CCC ACC AAA CCC CTT CTC ATG CTC CTG ATG GCC TTG CTG ACC CAG 342 
Ser Pro Arg Lye Gly Leu Leu Met Leu Leu Met Ala Leu Val Thr Gin 
5 10 15 20 

GCA GAC COT CTG AAG CCC TCT CCC GCC CCC CTG CTC ACC TCC ACC TCT 390 
Gly ASP Pro Val Lys Pro Ser Arg Gly Pro Leu Val Thr Cys Thr Cya 
25 30 35 

GAG AGC CCA CAT TCC AAG CGG CCT ACC TCC CCC CCC CCC TCC TCC ACA 438 
Glu Ser Pro His Cya Lya Gly Pro Thr Cya Arg Gly Ala Trp Cya Thr 
40 45 50 

GTA GTG CTC CTG CGG GAG GAG CGG AGC CAC CCC CAG CAA CAT CGG GGC 486 

Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin Glu His Arg Gly 
55 60 65 

TCC CGG AAC TTG CAC AGG CAG CTC TCC AGG GGC CCC CCC ACC GAG TTC 534 
Cya Gly Aen Leu His Arg Glu Leu Cya Arg Gly Arg Pro Thr Glu Phe 
70 75 80 

GTC AAC CAC TAC TCC TCC GAC AGC CAC CTC TGC AAC CAC AAC GTG TCC 582 
Val Asn Hia Tyr Cys Cys Asp Ser Hie Leu Cys Asn His Asn Val Ser 
85 90 95 100 

CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG CCC GGA ACA GAT 630 
Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin Pro Gly Thr Asp 
lOS 110 lis 

GGC CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG CCC TTG CTG GCC CTG 678 
Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala Leu Leu Ala Leu 
120 125 130 

GTG GCC CTG CGT GTC CTG GGC CTG TGC CAT GTC CCA CGG AGG CAG CAG 726 
Val Ala Leu Gly Val Leu Gly Leu Trp Hia Val Arg Arg Arg Gin Glu 
135 140 145 

AAG CAG CGT GGC CTC CAC AGC GAG CTC GGA CAG TCC ACT CTC ATC CTC 774 
Lya Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser Ser Leu lie Leu 
150 155 160 

AAA GCA TCT CAC CAG GGC GAC ACC ATG TTG CCC CAC CTC CTG GAC ACT 822 
Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp Leu Leu Asp Ser 
165 170 175 180 

GAC TGC ACC ACA GGC AGT GGC TCA CGG CTC CCC TTC CTG GTG CAG AGG 870 
Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu Val Gin Arg 
185 190 195 

ACA GTG GCA CGG CAG GTT GCC TTG GTG GAG TCT GTG GGA AAA GGC CGC 918 
Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly Lys Gly Arg 
200 205 210 

TAT GGC CAA GTG TGG CGG GGC TTG TGG CAC GGT GAG AGT GTG GCC GTC 966 
Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu Ser val Ala Val 
215 220 225 
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AAC ATC TTC TCC TCC AGO GAT CAA GAG TCC TGO TTC CGG GAG ACT GAG 1014 
Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Xrp Phe Arg Glu Thr Glu 
230 235 240 

ATC TAT AAC ACA GTA TTG CTC AGA CAC GAC AAC ATC CTA GGC TTC ATC 1062 
lie Tyr Asn Thr Val Leu Leu Arg Hia Asp Asn lie Leu Gly Phe lie 
24S 250 255 260 

GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG GAG CTG TGG CTC ATC 1110 
Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu Trp Leu lie 
265 270 275 

ACG CAC TAC CAC GAG CAC GGC TCC CTC TAC GAC ITT CTG CAG AGA CAG 1158 

Thr His Tyr Hia Glu Hia Gly Ser Leu Tyr Asp Phe Leu Gin Arg Gin 
280 285 290 

ACG CTG GAG CCC CAT CTG GCT CTG AGG CTA GCT GTG TCC GOG GCA TCC 1206 
Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val Ser Ala Ala Cys 
295 300 305 

GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA CAG GGC AAA CCA 1254 
Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin Gly Lys Pro 
310 315 320 

GCC ATT GCC CAC CCC GAC TTC AAG AGC CGC AAT GTG CTG GTC AAG AGC 1302 
Ala lie Ala His Arg Asp Phe Lys Ser Arg Asn Val Leu Val Lys Ser 
325 330 335 340 

AAC CTG CAG TGT TGC ATC GCC GAC CTG GGC CTG GCT GTG ATG CAC TCA 1350 
Asn Leu Gin Cya Cys lie Ala Asp Leu Gly Leu Ala Val Met Hia Ser 
345 350 355 

CAG GGC AGC GAT TAC CTG GAC ATC GGC AAC AAC CCC AGA GTG GGC ACC 1398 
Gin Gly Ser Asp Tyr Leu Asp lie Gly Asn Asn Pro Arg Val Gly Thr 
360 365 370 

AAG CGG TAC ATG GCA CCC GAG GTG CTG GAC GAG CAG ATC CGC ACG GAC 1446 
Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin lie Arg Thr Asp 
375 380 385 

TGC TTT GAG TCC TAC AAG TGG ACT GAC ATC TGG GCC TTT GGC CTG GTG 1494 
Cya Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe Cly Leu Val 
390 395 400 

CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC ATC GTG GAG GAC 1542 
Leu Trp Glu lie Ala Arg Arg Thr lie Val Asn Gly lie Val Glu Asp 
405 410 415 420 

TAT AGA CCA CCC TTC TAT GAT GTG GTG CCC AAT GAC CCC AGC TTT GAG 1590 
Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 
425 430 435 

GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC CCC ACC ATC CCT 1638 
Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr lie Pro 
440 445 450 

AAC CGG CTG GCT GCA GAC CCG GTC CTC TCA GGC CTA GCT CAG ATG ATG 1686 
Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 
455 460 465 
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CGG GAG TGC TGG TAG CCA AAC CCC TCT CCC CGA CTC ACC CCG CTG CGG 1734 
Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg L«u Thr Ala Lau Arg 
470 475 480 

ATC AAG AAG ACA OTA CAA AAA ATT AGC AAC AGT CCA GAG AAG CCT AAA 1782 
lie Lys Lya Thr Leu Gin Lys lie Ser Aan Ser Pro Glu Lys Pro Lye 
485 490 495 500 

GTG ATT CAA TAGCCCAGGA GCACCTGATT CCTTTCTCCC TGCAGGGGCC 1831 
Val lie Gin 

TGGGGGGGTG GGGGGCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT GAGTGTGGTG 1891 

TGTGCTGGGG ATGGGCAGCT GCGCCTGCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 

ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 1984 

(2) INFORMATION FOR SEQ ID MO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala 
15 10 15 

Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 
20 25 30 

Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 
35 40 45 

Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 
50 55 60 

Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 
65 70 75 80 

Pro Thr Glu Phe Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn 
85 90 95 

His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 110 

Pro Gly Thr Asp Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala 
115 120 125 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 
130 135 140 

Arg Arg Gin Glu Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser 
145 150 155 160 
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Ser Leu lie Leu Ly« Ala Ser Clu Gin Gly Asp Thr Met Leu Gly Aep 
165 170 175 

Leu Leu Asp Ser Aap Cy« Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe 
180 185 190 

Leu Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cya Val 
195 200 205 

Gly Lye Gly Arg Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu 
210 215 220 

Ser Val Ala Val Lye lie Phe Ser Ser Arg Asp Clu Gin Ser Trp Phe 
225 230 235 240 

Arg Glu Thr Glu He Tyr Aen Thr Val Leu Leu Arg His Asp Aan He 
245 250 255 

Leu Gly Phe He Ala Ser Aap Met Thr Ser Arg Aan Ser Ser Thr Gin 
260 265 270 

Leu Trp Leu He Thr His Tyr Hia Glu Hia Gly Ser Leu Tyr Aap Phe 
275 280 285 

Leu Gin Arg Gin Thr Leu Glu Pro Hia Leu Ala Leu Arg Leu Ala Val 
290 295 300 

Ser Ala Ala Cya Gly Leu Ala Hia Leu Hia Val Glu He Phe Glv Thr 
305 310 315 ^ 320 

Gin Gly Lya Pro Ala He Ala Hia Arg Aap Phe Lya Ser Arg Aan Val 
325 330 335 

Leu Val Lya Ser Aan Leu Gin Cya Cya He Ala Aap Leu Gly Leu Ala 
340 345 350 

Val Met His Ser Gin Gly Ser Aap Tyr Leu Aap He Gly Aan Aan Pro 
355 360 3es 

Arg Val Gly Thr Lya Arg Tyr Met Ala Pro Glu Val Leu Aap Glu Gin 
370 375 380 

He Arg Thr Aap Cya Phe Glu Ser Tyr Lya Trp Thr Aap He Trp Ala 
385 390 395 400 

Phe Gly Leu Val Leu Trp Glu He Ala Arg Arg Thr He Val Aan Gly 
405 410 415 

He Val Glu Aap Tyr Arg Pro Pro Phe Tyr Aap Val Val Pro Aan Aao 
420 425 430 

Pro Ser Phe Glu Aap Met Lya Lya Val Val Cya Val Aap Gin Gin Thr 
435 440 445 

Pro Thr He Pro Aan Arg Leu Ala Ala Aap Pro Val Leu Ser Gly Leu 
450 455 460 

Ala Gin Met Met Arg Glu Cya Trp Tyr Pro Aan Pro Ser Ala Arg Leu 
465 470 475 480 
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Thr Ala Leu Arg lie Lye Lys Thr X<eu Gin Lys lie Ser Aan Ser Pro 
485 490 495 

Glu Lye Pro Lys Val lie Gin 
500 

(2) INFORMATION FOR SBQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 
(O) TOPOLOGY: linear 

(ii) MOLECTJLE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 104.. 1630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCCGAGTAC CCCAGTGACC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 60 

GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 115 

Met Val Asp Gly 
1 

GTG ATG ATT CTT CCT GTG CTT ATC ATG ATT GCT CTC CCC TCC CCT AGT 163 
Val Met lie Leu Pro Val Leu lie Met lie Ala Leu Pro Ser Pro Ser 
5 10 15 20 

ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATG TGT GTG 211 
Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu Tyr Met Cys Val 
25 30 35 

TGT GAA GGT CTC TCC TGC GGT AAT GAG GAC CAC TGT GAA GGC CAG CAG 259 
Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys Glu Gly Gin Gin 
40 45 50 

TGC TTT TCC TCA CTG AGC ATC AAC GAT GGC TTC CAC GTC TAC CAG AAA 307 
Cys Phe Ser Ser Leu Ser lie Asn Asp Gly Phe His Val Tyr Gin Lys 
55 60 65 

GGC TGC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 355 
Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr Cys Lys Thr Pro 
70 75 80 
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CCG TCC CCT GGC CAA GCT GTC GAG TGC TGC CAA GGG GAC TGG TGT AAC 403 
Pro Ser Pro Gly Gin Ala Val Glu Cys Cya Gin Gly Aap Trp Cya Aan 

85 90 95 100 

AGG AAC ATC ACG GCC GAG CTG CCC ACT AAA GGA AAA TCC TTC CCT GGA 451 
Arg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys ser Phe Pro Gly 
105 110 115 

ACA CAG AAT TTC CAC TTG GAG GTT GGC CTC ATT ATT CTC TCT GTA GTG 499 
Thr Gin Asn Phe Hia Leu Glu Val Gly Leu lie lie Leu Ser Val Val 
120 125 130 

TTC GCA GTA TGT CTT TTA GCC TGC CTG CTG GGA GTT GCT CTC CGA AAA 547 
Phe Ala Val Cya Leu Leu Ala Cys Leu Leu Gly Val Ala Leu Arg Lys 
135 140 145 

TTT AAA AGG CGC AAC CAA GAA CGC CTC AAT CCC CGA GAC GTG GAG TAT 595 
Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg Asp Val Glu Tyr 
ISO 155 160 

GGC ACT ATC GAA GGG CTC ATC ACC ACC AAT GTT GGA GAC AGC ACT TTA 643 
Gly Thr lie Glu Gly Leu lie Thr Thr Asn Val Gly Asp Ser Thr Leu 
165 170 175 180 

GCA GAT TTA TTG CAT CAT TCC TGT ACA TCA GGA AGT GGC TCT GCT CTT 691 
Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly Ser Gly Leu 
185 190 195 

CCT TTT CTG GTA CAA AGA ACA GTG GCT CGC CAC ATT ACA CTG TTG GAG 739 
Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin lie Thr Leu Leu Glu 
200 205 210 

TGT GTC GGG AAA GGC AGG TAT GCT GAG GTG TGG AGG GGC AGC TGG CAA 787 
Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Gin 
215 220 225 

GGG GAA AAT GTT GCC GTG AAG ATC TTC TCC TCC CGT GAT GAG AAG TCA 835 
Gly Glu Asn Val Ala Val Lys lie Phe Ser Ser Arg Asp Glu Lys Ser 
230 235 240 

TGG TTC AGG GAA ACG GAA TTG TAC AAC ACT GTG ATG CTG AGG CAT GAA 883 
Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Hot, Leu Arg His Glu 
245 250 255 260 

AAT ATC TTA GGT TTC ATT GCT TCA GAC ATG ACA TCA AGA CAC TCC AGT 931 
Asn lie Leu Gly Phe lie Ala Ser Asp Met Thr Ser Arg His Ser Ser 
265 270 275 

ACC CAG CTG TGG TTA ATT ACA CAT TAT CAT GAA ATG GGA TCG TTG TAC 979 
Thr Gin Leu Trp Leu lie Thr Bis Tyr His Glu Met Gly Ser Leu Tyr 
280 285 290 

GAC TAT CTT CAG CTT ACT ACT CTG GAT ACA GTT AGC TGC CTT CGA ATA 1027 
Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cys Leu Arg lie 
295 300 305 

GTG CTG TCC ATA CCT ACT CCT CTT GCA CAT TTG CAC ATA CAG ATA TTT 1075 

Val Leu Ser lie Ala Ser Gly Leu Ala Hia Leu His lie Glu lie Phe 

310 315 320 
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GGC ACC CAA CGG AAA CCA CCC ATT CCC CAT CCA GAT TTA AAC AGO AAA 1123 
Glv Thr Gin Gly Ly« Pro Ala Il« Ala Hi» Arg Asp Leu Lya Sar Lys 
325 330 335 340 

AAT ATT CTC GTT AAC AAG AAT GGA CAG TGT TGC ATA GCA GAT TTC GGC 1171 
Asn lie Leu Val Lya Lya Aon Gly Gin Cya Cye lie Ala Aap Leu Gly 
345 350 355 

CTG GCA GTC ATG CAT TCC CAG AGC ACC AAT CAG CTT GAT GTG GGC AAC 1219 
Leu Ala Val Met His Ser Gin Ser Thr Aan Gin Leu Aap Val Gly Aan 
360 365 370 

AAT CCC CGT GTC GGC ACC AAG CGC TAC ATG CCC CCC GAA GTT CTA GAT 1267 
Aan Pro Arg val Gly Thr Lya Arg Tyr Met Ala Pro Glu Val Leu Aap 
375 380 385 

GAA ACC ATC CAC GTG CAT TGT TTC GAT TCT TAT AAA AGG GTC GAT ATT 1315 
Glu Thr lie Gin Val Aap Cya Pho Aap Ser Tyr Lya Arg Val Aap lie 
390 395 400 

TGC GCC TTT GGA CTT GTT TTC TGC GAA GTG GCC AGO CGG ATG GTG AGC 1363 
Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg Arg Met Val Ser 
405 410 415 420 

AAT GGT ATA CTC CAG CAT TAC AAG CCA CCC TTC TAC GAT GTG GTT CCC . 1411 
Aan Gly lie Val Glu Aap Tyr Lya Pro Pro Phe Tyr Aap Val Val Pro 
425 430 435 

AAT GAC CCA ACT TTT GAA GAT ATC AGG AAG GTA GTC TGT GTG GAT CAA 1459 
Aan Aap Pro Ser Phe Glu Aap Met Arg Lya val Val Cya Val Aap Gin 
440 445 450 

CAA AGG CCA AAC ATA CCC AAC AGA TGC TTC TCA GAC CCC ACA TTA ACC 1507 
Gin Arg Pro Aan lie Pro Aan Arg Trp Phe Ser Aap Pro Thr Leu Thr 
455 460 465 

TCT CTG GCC AAG CTA ATC AAA GAA TGC TGG TAT CAA AAT CCA TCC GCA 1555 
Ser Leu Ala Lya Leu Met Lya Glu Cye Trp Tyr Gin Aan Pro Ser Ala 
470 475 480 

AGA CTC ACA GCA CTC CGT ATC AAA AAG ACT TTC ACC AAA ATT GAT AAT 1603 
Arg Leu Thr Ala Leu Arg lie Lya Lya Thr Leu Thr Lya lie Aop Aan 
485 490 495 500 

TCC CTC GAC AAA TTG AAA ACT CAC TGT TGACATTTTC ATAGTGTCAA 1650 
Ser Leu Aap Lya Leu Lys Thr Aap Cya 
505 



GAAGGAAGAT 


TTCACGTTGT 


TGTCATTCTC 


CAGCTCCGAC 


CTAATCCTGG 


CCTGACTGCT 


1710 


TGTCAGAATG 


GAATCCATCT 


GTCTCCCTCC 


CCAAATGCCT 


GCTTTCACAA 


GGCAGACGTC 


1770 


GTACCCAGCC 


ATGTCTTCCC 


GAGACATCAA 


AACCACCCTA 


ACCTCGCTC6 


ATGACTGTGA 


1830 


ACTGGGCATT 


TCACCAACTC 


TTCACACTGC 


AGACACTAAT 


GTTGGACAGA 


CACTGTTGCA 


1890 


AAGGTAGGGA 


CTGGAGGAAC 


ACAGAGAAAT 


CCTAAAAGAG 


ATCTGGCCAT 


TAACTCAGTG 


1950 


GCTTTCCATA 


GCTTTCACAA 


CTCTCCTAGA 


CACTCCCCAC 


GGCAAACTCA 


AGGAGCTGGT 


2010 
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GAATTTTTAA 

CATTCCTTAC 

GCGTACTCCA 

TGTCAGACTT 

AATTGTTTAT 

AACTGCTTTG 

ATGTTTTTAA 

TTTAAGTGCT 

ACGTATTTAG 

TTTCAGTAGA 

ATTACGTGCA 

TATTTAGTAG 



TCAGCAATAT 

TTGCACTGTT 

CTCCTCTGTC 

TGCTGCATTT 

ACACAACTTT 

TGCATATGTT 

CACTATACTC 

TCACATTTGT 

CCATTACCCA 

ATTTTAGTCC 

TTTAAACTCT 

TTATTTGTAT 



TGCCTGTGCT 

ACTCTTAATT 

TTTCGATAAT 

TACACATCTC 

GCAAATTATT 

AAAGCTTATT 

TAAAATGGAC 

ATGTGTGTAG 

CGTGACACCA 

TGAACGCTAC 

GCCAGAAAAA 

AAATTAAATA 



43 

TCTCTTCTTT 

TTAAAGACCC 

AGGAATTCAA 

CTGATGTTTA 

TATTACTTGT 

TTTATCTGGT 

ATTTTCTTTT 

ACTGTAACTT 

CCGAATATAT 

GGGGAAAATG 

AATAACTATT 

AACTCTTT.TC 
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ATTGCACTAG 

AACTTGCCAA 

TTTGGCAAAA 

CAATGATGCC 

GCACTTAGTA 

CTTATGATTT 

ATTATCAGTT 

TTTTTCAGTT 

TATCGATTTA 

CATTTTCTTC 

TTGTTTTAAT 

AAGTCAAAAA 



GAATTCTTTC 

AATGTTCGCT 

CAAAATGTAA 

GAACATTAGG 

GTTTTTACAA 

TATTACAGAA 

AAAATCACAT 

CATATGCACA 

GAAGCAAAGA 

AGAATTATCC 

CTACTTTTTC 

AAAA 



(2) INFORMATION FOR SEQ ID NOi 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 509 amino acide 
(B> TYPE: amino acid 
(D) TOPOI-OGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val A8P Cly Val Met He Leu Pro Val Leu He Met He Ala Leu 
1 5 lO 

pro ser Pro Ser Met Glu Asp Glu Lye Pro Lye Val Aan Pro Lya Leu 
20 25 

Tyr Met Cys Val Cye Glu Cly Leu Ser Cys Gly Aen Glu Aap Hie Cye 

Glu Gly Gin Gin Cya Phe Ser Ser Leu Ser He Aen Aap Gly Phe Hia 
50 55 60 

val Tyr Gin Lya Gly Cya Phe Gin Val Tyr Glu Gin Gly Lya Met Thr 
65 70 75 

cya Lya Thr Pro Pro Ser Pro Gly Gin Ala Val Glu Cya Cya Gin Gly 

' as 

Aap Trp Cya Aen Arg Aan He Thr Ala Gin Leu Pro Thr Lya Cly Lya 
100 •»-^" 

ser Phe Pro Gly Thr Gin Aan Phe Hia Leu Glu Val Gly Leu He He 

2^15 120 



2070 

2130 

2190 

2250 

2310 

2370 

2430 

2490 

2550 

2610 

2670 

2724 
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'eu ser Val Val Ph« Ala Val Cya Lou Lau Ala Cya Lou Lou Gly Val 

130 135 

Ala Leu Arg Lys Phe Lye Arg Arg Aan Gin Glu Arg Leu Asn Pro Arg 
145 150 15S 16g 

Aao Val Glu Tyr Gly Thr lie Glu Gly Leu lie Thr Thr Aan Val Gly 
165 170 175 

ASP ser Thr Leu Ala Aap Leu Leu Aep His Ser Cys Thr Ser Gly Ser 
180 185 190 

Gly Ser Gly Leu Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin He 
195 200 205 

Thr Leu Leu Glu Cys Val Gly Lya Gly Arg Tyr Gly Glu Val Trp Arg 
210 215 220 

Glv ser Trp Gin Gly Glu Asn Val Ala Val Lya He Phe ser Ser Arg 
225 230 235 240 

ASP Glu Lys Ser Trp Phe Arg Glu Thr Glu Leu Tyr Aan Thr Val Met 
*^ 245 250 255 

LOU Arg His Glu Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser 
260 265 270 

Ara His Ser Ser Thr Gin Leu Trp Leu He Thr His Tyr His Glu Met 
^ 275 280 285 

Glv ser Leu Tyr Asp Tyr Leu Gin Leu Thr Thr Leu Aap Thr Val Ser 
290 295 300 

CVS Leu Arg He Val Leu Ser He Ala Ser Gly Leu Ala His Leu His 
305 310 315 320 

He Glu He Phe Gly Thr Gin Gly Lys Pro Ala Ha Ala His Arg Asp 
325 330 335 

Leu Lys Ser Lya Aan He Leu Val Lys Lys Asn Gly Gin Cys Cya Ho 
' 340 345 350 

Ala Asp Leu Gly Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu 
355 360 365 

Aap Val Gly Aan Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro 
370 375 380 

Glu Val Leu Asp Glu Thr He Gin Val Asp Cys Phe Aap Ser Tyr Lys 
385 390 395 *00 

Arg val Asp He Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg 
405 410 415 

Ara Met Val Ser Aan Gly Ho Val Glu Asp Tyr Lys Pro Pro Phe Tyr 
^ 425 430 



420 



ASP Val Val Pro Aan Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val 
435 440 445 
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Cya Val Asp Gin Cln Arg Pro Asn lie Pro Aan Arg Trp Ph« Sor Aap 
450 455 460 

Pro Thr Leu Thr Ser Leu Ala Lys Leu Met Lya Glu Cyo Trp Tyr Oln 
465 470 475 480 

Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Thr 
485 490 495 

Lys He Aap Aan Ser Leu Aap Lya Leu Lya Thr Asp Cys 

500 505 

(2) INFORMATION FOR SEQ ID NOt 5: 

(i) SEQOENCB CHARACTERISTICS: 

(A) LENGTHt 2932 baas pairs 

(B) TYPBt nucleic acid 

(C) STRANDBDNESS 1 unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAMB/XEY: CDS 

(B) LOCATION: 310.. 1905 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTCCGCGCC GAGCCCTGGA CGATGCCTTC CCTGCCCTCC GGACTTATGA AAATATCCAT 60 

CAGTTTAATA CTGTCTTCGA ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTTGGAGAA 120 

AATCAGAAGT ACAG T TTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 180 

TTGAAGTCAT TGTCAACTGC TTGCGATCTT TTACAAGAAA ATCTCACTCA ATGATAGTCA 240 

TTTAAATTCG TGAAGTAGCA AGACCAATTA TTAAAGGTGA CAGTACACAC GAAACATTAC 300 

AATTGAACA ATC ACT CAG CTA TAG ATT TAC ATC AGA TTA TTG GGA GCC 348 
Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Cly Ala 
15 10 

TAT TTG TTC ATC ATT TCT CGT GTT CAA GGA CAG AAT CTC GAT ACT ATO 396 
Tvr Leu Phe He He Ser Arg Val Cln Cly Cln Asn Leu Asp Ser Met 
15 20 25 
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CTT CAT GGC ACT GGC ATG AAA TCA CAC TCC GAC CAG AAA AAC TCA GAA 
Leu His Gly Thr Gly Met Lye Ser Aap Ser Asp Gin Lye Lys Ser Glu 
30 35 40 45 

AAT GGA GTA ACC TTA GCA CCA GAG GAT ACC TTG CCT TTT TTA AAO TCC 
Asn Gly Val Thr Leu Ala Pro Glu Aap Thr Leu Pro Phe Leu Ly« Cy» 
50 55 60 

TAT TGC TCA GGC CAC TGT CCA GAT CAT GCT ATT AAT AAC ACA TGC ATA 
Tyr Cys Ser Gly His Cyo Pro Asp Asp Ala lie Asn Asn Thr Cys lie 
65 70 75 

ACT AAT GGA CAT TGC TTT CCC ATC ATA GAA GAA GAT GAC CAG GGA GAA 

Thr Asn Gly His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu 
80 85 90 

ACC ACA TTA GCT TCA GGC TGT ATC AAA TAT GAA GGA TCT GAT TTT CAG 
Thr Thr Leu Ala Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin 
95 100 lOS 

TGC AAA GAT TCT CCA AAA GCC CAO CTA CGC CGO ACA ATA GAA TCT TGT 
Cvs Lys Asp Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys 
110 115 120 125 

CGG ACC AAT TTA TGT AAC CAG TAT TTG CAA CCC ACA CTC CCC CCT GTT 
Arg Thr Asn Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val 
130 135 140 

GTC ATA GGT CCG TTT TTT GAT GGC AGC ATT CCA TGC CTG GTT TTG CTC 
Val lie Gly Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Leu Leu 
145 ISO 155 

ATT TCT ATG GCT GTC TGC ATA ATT CCT ATG ATC ATC TTC TCC AGC TGC 
He Ser Met Ala Val Cys He He Ala Met He He Phe Ser Ser Cys 
160 165 170 

TTT TGT TAC AAA CAT TAT TCC AAG AGC ATC TCA AGC AGA CGT CCT TAC 
Phe Cys Tyr Lys His Tyr Cys Lys Ser He Ser Ser Arg Arg Arg Tyr 
175 180 185 

AAT CGT GAT TTG GAA CAG GAT CAA GCA TTT ATT CCA CTT GGA GAA TCA 
Asn Arg Asp Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser 
190 195 200 205 

CTA AAA GAC CTT ATT GAC CAG TCA CAA ACT TCT GGT ACT CGG TCT GGA 
Leu Lys Asp Leu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly 
210 215 220 

CTA CCT TTA TTG GTT CAG CCA ACT ATT CCC AAA CAG ATT CAG ATG GTC 
Leu Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val 
225 230 235 

CGG CAA CTT GGT AAA CGC CCA TAT GGA GAA GTA TGC ATG GCC AAA TGG 
Arg Gin Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp 
240 245 250 

CGT GGC GAA AAA GTG GCG GTC AAA GTA TTC TTT ACC ACT GAA GAA CCC 
Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala 
255 260 265 



444 



492 



540 



588 



636 



684 



732 



780 



828 



876 



924 



972 



1020 



1068 



1116 
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AGC TGG TTT CCA GAA ACA GAA ATC TAC CAA ACT GTC CTA ATC CCC CAT 1164 
Ser Trp Phc Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His 
270 275 280 285 

GAA AAC ATA CTT GGT TTC ATA GCC GCA CAC ATT AAA GGT ACA GCT TCC 1212 
Glu Asn lie Leu Gly Phe lie Ala Ala Aep He Lys Gly Thr Gly Ser 
290 295 300 

TGG ACT CAC CTC TAT TTG ATT ACT GAT TAC CAT GAA AAT GGA TCT CTC 1260 
Trp Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu 
305 310 315 

TAT GAC TTC CTC AAA TGT GCT ACA CTG GAC ACC AGA CCC CTG CTT AAA 1308 
Tvr Asp Phe Lau Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys 
320 325 330 

TTG GCT TAT TCA GCT GCC TGT GGT CTG TGC CAC CTG CAC ACA GAA ATT 1356 
Leu Ala Tyr Ser Ala Ala Cys Gly Leu Cya His Leu His Thr Glu He 
335 340 345 

TAT CGC ACC CAA GGA AAC CCC CCA ATT GCT CAT CGA CAC CTA AAC AGC 1404 
Tyr Gly Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser 
350 355 360' 365 

AAA AAC ATC CTC ATC AAC AAA AAT GGG ACT TGC TGC ATT GCT GAC CTG 1452 
Lys Asn He Leu He Lys Lys Aan Gly Ser Cys Cys He Ala Asp Leu 
370 375 380 

GGC CTT GCT GTT AAA TTC AAC ACT GAC ACA AAT GAA CTT GAT CTG CCC 1500 
Gly Leu Ala Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp Val Pro 
385 390 395 

TTG AAT ACC ACC CTG GCC ACC AAA CGC TAC ATG OCT CCC GAA GTC CTC 1548 
Leu Aan Thr Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 
400 405 410 

CAC GAA AGC CTG AAC AAA AAC CAC TTC CAC CCC TAC ATC ATC GCT GAC 1596 
Asp Glu Ser Leu Asn Lys Asn His Phe Gin Pro Tyr He Met Ala Asp 
415 420 425 

ATC TAC AGC TTC GCC CTA ATC ATT TGG GAG ATG GCT CGT CGT TGT ATC 1644 
He Tyr Ser Phe Gly Leu He He Trp Glu Met Ala Arg Arg Cys He 
430 435 440 445 

ACA GGA CGC ATC CTC GAA GAA TAC CAA TTC CCA TAT TAC AAC ATG CTA 1692 
Thr Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val 
450 455 460 

CCG ACT GAT CCG TCA TAC GAA GAT ATC CGT GAG GTT GTC TGT CTC AAA 1740 
Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cya Val Lya 
465 470 475 

CGT TTG CGC CCA ATT CTC TCT AAT CCG TGG AAC ACT GAT GAA TCT CTA 1788 
Arg Leu Arg Pro He Val Ser Aan Arg Trp Aan Ser Aap Glu Cya Leu 
480 485 490 

CGA GCA GTT TTG AAG CTA ATG TCA GAA TGC TGG GCC CAC AAT CCA GCC 1836 
Arg Ala Val Leu Lys Leu Met Ser Glu Cya Trp Ala His Asn Pro Ala 
495 500 505 
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TCC AGA CTC ACA GCA TTO ACA ATT AAO AAO ACO CTT GCC AAO ATO CTT 1884 
Ser Ara Leu Thr Ala Leu Arg lie Ly« Ly« Thr Leu Ala Lye Met Val 

SlO 515 520 525 

GAA TCC CAA CAT GTA AAA ATC TGATCGTTAA ACCATCGGAG GAGAAACTCT 1935 
Glu Ser Gin Asp Val Lys lie 
530 

AGACTGCAAC AACTGTTTTT ACCCATGGCA TGCGTGCAAT TAGACTGGAA TAACOATGTT 1995 

AACTTGCTTC TCAGACTCTT TCTTCACTAC GTGTTCACAC CCTCCTAATA TTAAACCTTT 2055 

CAGTACTCTT ATTAGGATAC AAGCTGC6AA CTTCTAAACA CTTCATTCTT TATATATGGA 2115 

CAGCTTTATT TTAAATCTCC TTTTTGATGC CTTTTTTTAA GTCCGTTTTT ATGAACTGCA 2175 

TCAAGACTTC AATCCTGATT AGTCTCTCCA GTCAAGCTCT GGGTACTGAA TTCCCTGTTC 2235 

ATAAAACGGT CCTTTCTCTO AAAOCCTTAA GAAGATAAAT GAGCCCAGCA GAGATGGAGA 2295 

AATAGACTTT GCCTTTTACC TGAGACATTC ACTTCGTTTC TATTCTACCT TTGTAAAACA 2355 

GCCTATAGAT CATGATGTGT TTGCGATACT GCTTATTTTA TGATACTTTC TCCTCTCTCC 2415 

TTAGTGATGT CTCTCTGTCT CCATGCACAT CCACOCCGGO ATTCCTCTGC TGCCATTTGA 2475 

ATTAGAAGAA AATAATTTAT ATGCATGCAC AGCAAGATAT TGCTGGCCCC TGCTTTTCTC 2535 

CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTCCA 2595 

ACTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 2655 

AGAAGTTTAA AGCATCTCTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTCTGG 2715 

TTATTATTTT TGTCACCGAA AGCATCCTCT CCAAA6TTGG AGCTTCTATT CCCAT6AACC 2775 

ATCCTTACAA AGAAAGCACT TCTTATTCAA GTCAATTCCT GCATTTGATA GCAATGTAAG 2835 

TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGCC AACTTATTTA 2895 

TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 2932 

(2) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Thr Gin Leu Tyr lie Tyr lie Arg Leu Leu Cly Ala Tyr Leu Phe 
1 5 10 15 

lie lie ser Arg Val Gin Gly Gin Aen Leu Asp Ser Met Leu Hie Cly 
20 25 30 
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Gin Gly Lys Pre Ala lie Ala Hia Arg Aap Leu Lys Ser Lys Asn lie 

355 360 365 

Leu lie Lys Lya Asn Gly Ser Cys Cya lie Ala Aap Leu Gly Leu Ala 

370 375 380 

Val Lys Phe Aen Ser Asp Thr Asn Glu Val Asp Val Pro Leu Asn Thr 
385 390 395 400 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp lie Tyr Ser 
420 425 430 

Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie Thr Gly Gly 
435 440 445 

lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Mot Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro lie Val Ser Asn Arg Trp Asn ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

Asp Val Lys lie 
530 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

(ill) HYPOTHETICAL: NO 

(Hi) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1515 
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Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Sar Glu Asn cly Val 
35 40 45 

Thr Leu Ala Pro Glu Aap Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 
50 55 60 

Gly His eye Pro Asp Asp Ala He Asn Asn Thr Cys He Thr Asn Gly 
65 70 75 80 

His Cys Phe Ala He He Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 
85 90 95 

Ala Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 110 

Ser Pro Lys Ala Gin Leu Arg Arg Thr Ha Glu Cys Cys Arg Thr Asn 
115 120 125 

Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val He Gly 
130 135 140 

Pro Phe Phe Asp Gly Ser He Arg Trp Leu Val Leu Leu He Ser Met 
145 150 155 160 

Ala Val Cys He He Ala Met He He Phe Ser Ser Cys Phe Cys Tyr 
165 170 175 

Lys His Tyr Cys Lys Ser He Ser Ser Arg Arg Arg Tyr Asn Arg Asp 
180 185 190 

Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

Leu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 
245 250 255 

Lya Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 285 

Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 

Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu He Tyr Gly Thr 
340 345 350 
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(xi) SSQUENCS OZSCRIPTIONi SBQ ID NOt 7: 

ATC GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT CTT GTC CTC 48 
Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

CTG CTC GCC GGC AGC GCC GGC TCC GGG CCC CCG GGG GTC CAG GCT CTG 96 
Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly V&l Gin Ala Leu 
20 25 30 

CTC TGT GCC TGC ACC AGC TGC CTC CAG GCC AAC TAC ACG TCT GAG ACA 144 
Leu Cys Ala Cya Thr Ser Cys Leu Gin Ala Asn Tyr Thr Cys Glu Thr 
35 40 45 

GAT GGG CCC TGC ATG GTT TCC TTT TTC AAT CTG GAT GGG ATG GAG CAC 192 
Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Het. Glu His 
50 55 60 

CAT GTG CGC ACC TGC ATC CCC AAA GTG GAG CTG GTC CCT GCC GGG AAG 240 
His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

CCC TTC TAC TGC CTG AGC TCG GAG GAC CTG CGC AAC ACC CAC TGC TGC 288 
Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys cys 

85 90 95 

TAC ACT GAC TAC TGC AAC AGG ATC GAC TTG AGG GTG CCC AGT GGT CAC 336 
Tyr Thr Asp Tyr Cvs Asn Arg lie Asp Leu Arg Val Pro Ser Gly His 
100 * 105 110 

CTC AAG GAG CCT GAG CAC CCG TCC ATG TGG GGC CCG GTG GAG CTG GTA 384 
Leu Lys Glu Pro Glu His Pro Ser Met: Trp Gly Pro Val Glu Leu Val 
115 120 125 

GGC ATC ATC GCC GGC CCG GTG TTC CTC CTG TTC CTC ATC ATC ATC ATT 432 
Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie lie lie 
130 135 140 

CTT TTC CTT GTC ATT AAC TAT CAT CAG CGT GTC TAT CAC AAC CGC CAC 480 
Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 ISO 155 160 

AGA CTG GAC ATG GAA GAT CCC TCA TGT GAG ATG TGT CTC TCC AAA GAC 528 
Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lya Asp 
165 170 175 

AAG ACG CTC CAG GAT CTT GTC TAC GAT CTC TCC ACC TCA GGG TCT GGC 576 
Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

TCA GGG TTA CCC CTC TTT GTC CAG CGC ACA GTG GCC CGA ACC ATC GTT 624 
Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

TTA CAA GAG ATT ATT GGC AAG GGT CGG TTT GCC GAA GTA TCG CGG GGC 672 
Leu Gin Glu lie lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 
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CGC TGG AGG GGT GOT GAT GTG GOT GTG AAA ATA TTC TCT TCT CGT CAA 720 
Arg Trp Arg Gly Gly Asp Val Ala Val Lya 11* Pha Ser Ser Arg Glu 
225 230 235 240 

GAA CGG TCT TGG TTC AGG GAA OCA GAG ATA TAC CAG ACG GTC ATG CTG 768 
Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tvr Gin Thr Val Met Leu 
245 250 255 

CGC CAT GAA AAC ATC CTT GGA TTT ATT GCT SCT GAC AAT AAA GAT AAT 816 
Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lya Asp Asn 
260 265 270 

GGC ACC TGG ACA CAG CTG TGG CTT GTT TCT GAC TAT CAT GAG CAC GGG 864 
Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 
275 280 285 

TCC CTG TTT GAT TAT CTG AAC CGG TAC ACA GTG ACA ATT GAG GGG ATG 912 
Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
290 295 300 

ATT AAG CTG GCC TTG TCT GCT GCT AGT GGG CTG GCA CAC CTG CAC ATG 960 
He Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
305 310 315 320 

GAC ATC CTG GCC ACC CAA GGO AAG CCT CCA ATT GCT CAT CCA GAC TTA 1008 
Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His Arg Asp Leu 
325 330 335 

AAG TCA AAG AAC ATT CTG GTG AAG AAA AAT GGC ATG TGT GCC ATA GCA 1056 
Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys Ala He Ala 
340 345 350 

GAC CTG GGC CTG GCT GTC CGT CAT GAT GCA GTC ACT GAC ACC ATT GAC 1104 
Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr He Asp 
355 360 365 

ATT GCC CCG AAT CAG AGG GTG GGG ACC AAA CGA TAC ATG GCC CCT GAA 1152 
He Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

GTA CTT GAT GAA ACC ATT AAT ATG AAA CAC TTT GAC TCC TTT AAA TGT 1200 
Val Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390 395 400 

GCT GAT ATT TAT GCC CTC CGG CTT GTA TAT TGG GAG ATT GCT CGA AGA 1248 
Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp Glu He Ala Arg Arg 
405 410 415 

TGC AAT TCT GGA GGA GTC CAT GAA GAA TAT CAG CTG CCA TAT TAC GAC 1296 
Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

TTA GTG CCC TCT GAC CCT TCC ATT GAG GAA ATG CGA AAG GTT GTA TGT 1344 
Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

GAT CAG AAG CTG CGT CCC AAC ATC CCC AAC TGG TGG CAG AGT TAT GAG 1392 
Asp Gin Lys Leu Arg Pro Asn He Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 
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GCA CTC CGG GTG ATG GGC AAG ATG ATG CGA GAG TGT TGG TAT GCC AAC 1440 
Ala Leu Arg Val Met Gly Ly» Mot Mot Arg Glu Cyo Trp Tyr Ala Aan 
465 470 475 480 

GGC GCA GCC CGC CTC ACG GCC CTG CGC ATC AAG AAG ACC CTC TCC CAG 1488 
Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 
485 490 495 



CTC AGC GTG CAG GAA GAC GTG AAG ATC TAACTGCTCC CTCTCTCCAC 1535 
Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 



ACGGAGCTCC 


TGGCAGCGAG 


AACTACGCAC 


AGCTGCCGCG 


TTGAGCGTAC 


GATGGAGCCC 


1595 


TACCTCTCGT 


TTCTGCCCAG 


CCCTCTGTGG 


CCAGGAGCCC 


TGGCCCGCAA 


GAGGGACAGA 


1655 


GCCCGGGA6A 


6ACTCGCTCA 


CTCCCATGTT 


GGGTTTGAGA 


CAGACACCTT 


TTCTATTTAC 


1715 


CTCCTAATGG 


CATGGAGACT 


CTGAGAGCGA 


ATTGTGTGGA 


GAACTCAGTG 


CCACACCTCG 


1775 


AACTGGTTGT 


AGTGGGAACT 


CCCGCGAAAC 


CCGGTGCATC 


TGGCACGTGG 


CCAGGAGCCA 


1835 


TGACAGGGGC 


GCTTGGGAGG 


GGCCGGAGGA 


ACCCAGGTGT 


TGCCAGTGCT 


AAGCTGCCCT 


1895 


GAGGGTTTCC 


TTCGGGGACC 


AGCCCACAGC 


ACACCAAGGT 


GGCCCGGAAG 


AACCAGAAGT 


1955 


GCAGCCCCTC 


TCACAGGCAG 


CTCTGAGCCG 


CGCTTTCCCC 


TCCTCCCTGG 


GATGGACGCT 


2015 


GCCGGGAGAC 


TGCCAGTGGA 


GACGGAATCT 


GCCGCTTTGT 


CTGTCCAGCC 


GTGTGTGCAT 


2075 


GTGCCGAGGT 


GCGTCCCCCG 


TTGTGCCTGG 


TTCGTCCCAT 


GCCCTTACAC 


GTGCGTGTGA 


2135 


GTGTCTGTGT 


CTGTCTGTAG 


GTCCCCACTT 


ACCTCCTTGA 


GCTTTCTGTG 


CATCTGCAGG 


2195 


TCGGGGGTGT 


GCTCGTCATG 


CTGTCCGTGC 


TTGCTGGTGC 


CTCTTTTCAG 


TAGTGAGCAG 


2255 


CATCTAGTTT 


CCCTGCTGCC 


CTTCCCTGGA 


GGTCTCTCCC 


TCCCCCAGAG 


CCCCTCATGC 


2315 


CACAGTGGTA 


CTCTGTGT 










2333 



(2) INFORMATION FOR SEQ ID NOt 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGr: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NOt 8: 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 IS 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
20 25 30 
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Leu Cya Ala Cya Thr Ser Cys Leu Gin Ala Aan Tyr Thr Cys Glu Thr 
35 40 45 

Asp Gly Ala Cys Met Val Ser Phe Phe Asn Leu Asp Gly Mat Glu Hia 

50 55 60 

Hia Val Arg Thr Cya lie Pro Lya Val Glu Leu Val Pro Ala Gly Lya 
65 70 75 80 

Pro Phe Tyr Cya Leu Ser Ser Glu Aap Leu Arg Aan Thr Hia Cya Cya 
85 90 95 

Tyr Thr Aap Tyr Cya Aan Arg lie Aap Leu Arg Val Pro Ser Gly Hia 
100 105 110 

Leu Lya Glu Pro Glu His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie lie lie 
130 135 140 

Val Phe Leu Val lie Asn Tyr Hia Gin Arg Val Tyr Hia Aan Arg Gin 
145 150 155 160 

Arg Leu Aap Met Glu Aap Pro Ser Cya Glu Met Cya Leu Ser Lya Aap 
165 170 175 

Lya Thr Leu Gin Aap Leu Val Tyr Aap Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr lie Val 
195 200 205 

Leu Gin Glu lie lie Gly Lya Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

Arg Trp Arg Gly Gly Aap Val Ala Val Lya lie Phe Ser Ser Arg Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Glu Ala Glu lie Tyr Gin Thr Val Met Leu 
245 250 255 

Arg Hia Glu Aan lie Leu Gly Phe lie Ala Ala Aap Aan Lya Aap Aan 
260 265 270 

Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Aap Tyr Hia Glu Hia Gly 
275 280 285 

Ser Leu Phe Aap Tyr Leu Asn Arg Tyr Thr Val Thr lie Glu Gly Met 
290 295 300 

lie Lya Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala Hia Leu Hia Met 

305 310 315 320 

Glu lie Val Gly Thr Gin Gly Lya Pro Gly Zla Ala Hia Arg Aap Leu 
325 330 335 

Lya Ser Lya Aan lie Leu Val Lya Lya Aan Gly Met Cys Ala He Ala 
340 345 350 
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Asp Leu Cly Leu Ala Val Arg His Aep Ala Val Thr Asp Thr lie Aap 

355 360 365 

lie Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

Val Leu Asp Glu Thr He Asn Met Lys His Pha Asp Ser Phe Lys Cys 
385 390 395 400 

Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp Glu He Ala Arg Arg 
405 410 415 

Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

Asp Gin Lys Leu Arg Pro Asn He Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Gin 
485 490 495 

Leu Ser Val Gin Glu Asp Val Lys He 
500 505 

(2) INFORMATION FOR SEQ 10 NO* 9: 

<i) SEQX7ENCE CHARACTERISTICS ( 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 77.. 1585 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGCGAGGCGA GGTTTGCTGG GGTGAGGCAG CGGCGCGGCC GGGCCGGGCC GGGCCACAGG 60 
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CGGTGCCGGC GGGACC ATC GAG CCG CCC GTC GCT GCT CCC CGT CCC CGC 109 

Mai: Glu him. Ala Val Ala Ala Pro Arg Pro Arg 
IS 10 

CTG CTC CTC CTC GTG CTC GCG GCG CCG GCG GCG GCG CCC CCC GCG CTG 157 
Leu Leu Leu Leu Val Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu 
15 20 25 

CTC CCC GCG GCG ACC CCG TTA CAC TGT TTC TGC CAC CTC TGT ACA AAA 205 
Leu Pro Gly Ala Thr Ala Leu Gin Cy« Phe Cy« His Leu Cy« Thr Lyo 
30 35 40 

GAC AAT TTT ACT TGT GTG ACA GAT CCG CTC TGC TTT GTC TCT GTC ACA 253 
Aap Asn Phe Thr Cye Val Thr Asp Gly Leu Cya Phe Val Ser Val Thr 
45 50 55 

GAG ACC ACA GAC AAA GTT ATA CAC AAC ACC ATC TGT ATA GCT CAA ATT 301 
Glu Thr Thr Asp Lya Val lie Hia Asn Ser Mete Cya lie Ala Glu lie 
60 65 70 75 

GAC TTA ATT CCT CCA CAT AGG CCG TTT CTA TGT CCA CCC TCT TCA AAA 349 
Asp Leu lie Pro Arg Asp Arg Pro Phe Val Cya Ala Pro Ser Ser Lya 
80 85 90 

ACT CCC TCT GTG ACT ACA ACA TAT TCC TGC AAT CAC GAC CAT TGC AAT 397 
Thr Gly Ser Val Thr Thr Thr Tyr Cya Cya Aan Gin Aap Hia Cya Asn 
95 100 105 

AAA ATA GAA CTT CCA ACT ACT GTA AAC TCA TCA CCT CCC CTT GOT CCT 445 

Lya lie Glu Leu Pro Thr Thr Val Lya Ser Ser Pro Gly Leu Gly Pro 
110 115 120 

GTC GAA CTC GCA GCT CTC ATT GCT GCA CCA GTC TCC TTC GTC TGC ATC 493 
Val Glu Leu Ala Ala Val lie Ala Gly Pro Val Cya Phe Val Cya lie 
125 130 135 

TCA CTC ATC TTC ATG GTC TAT ATC TCC CAC AAC CCC ACT GTC ATT CAC 541 

Ser Leu Met Leu Met Val Tyr lie Cya His Aan Arg Thr Val lie His 
140 145 150 155 

CAT CGA GTC CCA AAT GAA GAG GAC CCT TCA TTA GAT CCC CCT TTT ATT 589 
Hia Arg Val Pro Aan Glu Glu Aap Pro Ser Leu Asp Arg Pro Phe lie 
160 165 170 

TCA GAG GGT ACT ACC TTC AAA CAC TTA ATT TAT GAT ATG ACA ACC TCA 637 
Ser Glu Gly Thr Thr Leu Lys Asp Leu lie Tyr Asp Met Thr Thr Ser 
175 180 185 

GGT TCT GGC TCA GGT TTA CCA TTC CTT CTT CAC ACA ACA ATT GCG ACA 685 
Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr lie Ala Arg 
190 195 200 

ACT ATT GTC TTA CAA GAA AGC ATT GGC AAA GGT CCA TTT CGA GAA GTT 733 
Thr lie Val Leu Gin Glu Ser lie Gly Lya Gly Arg Phe Gly Glu Val 
205 210 215 

TGC ACA GGA AAG TGC CGC CGA GAA GAA GTT CCT CTT AAG ATA TTC TCC 781 
Trp Arg Gly Lya Trp Arg Gly Glu Glu Val Ala Val Lys lie Phe Ser 
220 225 230 235 
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TCT AGA GAA GAA CGT TCG TGG TTC COT GAG GCA GAG ATT TAT CaiA ACT 
Ser Arc Glu Glu Arg Ser Trp Pho Arg Glu Ala Clu I la Tyr Gin Thr 
240 245 250 

GTA ATG TTA CGT CAT GAA AAC ATC CTG GGA TTT ATA CCA GCA GAC AAT 
Val Met Leu Arg His Glu Asn lie Leu Gly Phe lie Ala Ala Asp Asn 
255 260 265 

AAA GAC AAT GGT ACT TGG ACT CAC CTC TGG TTG CTG TCA GAT TAT CAT 
Lya Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
270 275 280 



829 



877 



925 



GAG CAT GGA TCC CTT TTT GAT TAC TTA AAC AGA TAC ACA GTT ACT GTG 973 
Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val 
285 290 295 

GAA GGA ATG ATA AAA CTT GCT CTC TCC ACG GCC AGC GGT CTT GCC CAT 1021 
Glu Gly Met lie Lys Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His 
300 305 310 315 

CTT CAC ATG GAG ATT GTT GGT ACC CAA CCA AAC CCA GCC ATT GCT CAT 1069 
Leu His Met Glu lie Val Gly Thr Gin Gly Lys Pro Ala lie Ala His 
320 325 330 

ACA GAT TTG AAA TCA AAG AAT ATC TTG GTA AAG AAG AAT GGA ACT TCC 1117 
Arg Asp Leu Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys 
335 340 345 



TGT ATT GCA GAC TTA GGA CTG GCA GTA AGA CAT GAT TCA GCC ACA GAT 1165 
Cvs lie Ala Asp Leu Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp 
350 355 360 

ACC ATT GAT ATT GCT CCA AAC CAC AGA GTG GGA ACA AAA AGO TAC ATC 1213 
Thr lie Asp lie Ala Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met 
365 370 375 

GCC CCT GAA GTT CTC GAT GAT TCC ATA AAT ATG AAA CAT TTT GAA TCC 1261 
Ala Pro Glu Val Leu Asp Asp Ser lie Asn Met Lys His Phe Glu Ser 
380 385 390 395 



TTC AAA CGT GCT GAC ATC TAT GCA ATG GGC TTA GTA TTC TGG GAA ATT 
Phe Lys Arg Ala Asp lie Tyr Ala Met Gly Leu Val Phe Trp Glu lie 
400 405 410 



1309 



GCT CGA CGA TGT TCC ATT GGT GGA ATT CAT GAA GAT TAC CAA CTG CCT 
Ala Arg Arg Cys Ser lie Gly Gly lie His Glu Asp Tyr Gin Leu Pro 
415 420 425 



1357 



TAT TAT GAT CTT GTA CCT TCT GAC CCA TCA GTT GAA GAA ATC ACA AAA 1405 
Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys 
430 435 440 

GTT GTT TCT GAA CAG AAG TTA ACG CCA AAT ATC CCA AAC AGA TGG CAC 1453 
Val Val Cys Glu Gin Lys Leu Arg Pro Asn lie Pro Asn Arg Trp Gin 
445 450 455 

ACC TCT CAA GCC TTC ACA GTA ATC CCT AAA ATT ATC ACA CAA TCT TGG 1501 
Ser Cvs Glu Ala Leu Arg Val Met Ala Lys lie Met Arg Clu Cys Trp 
460 " 465 470 475 
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TAT CCC AAT GCA GCA OCT AGG CTT ACA OCA TTG CGC ATT AAC AAA ACA 1549 
Tvr Ala Asn Gly Ala Ala Arg Lou Thr Ala Lou Arg He Lya Lya Thr 
480 485 490 



TTA TCG CAA CTC ACT CAA CAG GAA GGC ATC AAA ATG TAATTCTACA 1595 
Leu Ser Gin Leu Ser Gin Gin Glu Gly He Lya Met 
495 500 



GCTTTGCCTG 


AACTCTCCTT 


TTTTCTTCAG 


ATCTGCTCCT 


GGGTTTTAAT 


TTGGGAGGTC 


1655 


AGTTGTTCTA 


CCTCACTGAC 


AGGGAACAGA 


AGGATATTGC 


TTCCTTTTGC 


AGCAGTGTAA 


1715 


TAAAGTCAAT 


TAAAAACTTC 


CCAGGATTTC 


TTTGCACCCA 


GGAAACAGCC 


ATCTGGGTCC 


1775 


TTTCTGTGCA 


CTATGAACGC 


TTCTTTCCCA 


GGACAGAAAA 


TGTGTAGTCT 


ACCTTTATTT 


1835 


TTTATTAACA 


AAACTTGTTT 


TTTAAAAAGA 


TGATTGCTGG 


TCTTAACTTT 


AGGTAACTCT 


1895 


GCTGTGCTGG 


AGATCATCTT 


TAAGGGCAAA 


GGAGTTGGAT 


TGCTGAATTA 


CAATGAAACA 


1955 


TGTCTTATTA 


CTAAAGAAAG 


TGATTTACTC 


CTGGTTAGTA 


CATTCTCACA 


GCATTCTGAA 


2015 


CCACTAGAGT 


TTCCTTGATT 


CAGACTTTGA 


ATGTACTGTT 


CTATAGTTTT 


TCAGGATCTT 


2075 


AAAACTAACA 


CTTATAAAAC 


TCTTATCTTG 


AGTCTAAAAA 


TGACCTCATA 


TAGTAGTGAG 


2135 


GAACATAATT 


CATGCAATTG 


TATTTTGTAT 


ACTATTATTC 


TTCTTTCACT 


TATTCAGAAC 


2195 


ATTACATGCC 


TTCAAAATGG 


GATTGTACTA 


TACCAGTAAG 


TGCCACTTCT 


GTGTCTTTCT 


2255 


AATGGAAATG 


AGTAGAATTG 


CTGAAAGTCT 


CTATGTTAAA 


ACCTATAGTG 


TTT 


2308 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Glu Ala Ala Val Ala Ala Pro Arg Pro Arg Leu Leu Leu Leu Val 
15 10 15 

Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu Leu Pro Gly Ala Thr 
20 25 30 

Ala Leu Gin Cya Phe Cys His Leu Cys Thr Lys Asp Aen Phe Thr Cys 
35 40 45 

Val Thr Asp Gly Leu Cys Phe Val Ser Val Thr Glu Thr Thr Asp Lys 
50 55 60 

Val He His Asn Ser Met Cys He Ala Glu He Asp Leu He Pro Arg 
65 70 75 80 
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Asp Arg Pro Phe Val Cys Ala Pro Sor Sor Lya Thr Gly Ser Val Thr 
as 90 95 

Thr Thr Tyr Cya Cys Asn Gin Asp His Cya Aan Lya lie Glu Leu Pro 
100 105 110 

Thr Thr Val Lya Ser Ser Pro Gly Leu Gly Pro Val Glu Leu Ala Ala 
115 120 125 

Val lie Ala Gly Pro Val Cys Pho Val Cya lie Ser Leu Met Leu Met 
130 135 140 

Val Tyr lie Cya Hia Asn Arg Thr Val lie Hia Hia Arg Val Pro Aan 
145 150 155 160 

Glu Glu Asp Pro Ser Leu Asp Arg Pro Phe lie Ser Glu Gly Thr Thr 
165 170 175 

Leu Lya Aep Leu lie Tyr Aap Met Thr Thr Ser Gly Ser Gly Ser Gly 
180 185 190 

Leu Pro Leu Leu Val Gin Arg Thr lie Ala Arg Thr lie Val Leu Gin 
195 200 205 

Glu Ser lie Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly Lya Trp 
210 215 220 

Arg Gly Glu Glu Val Ala Val Lya lie Phe Ser Ser Arg Glu Glu Arg 
225 230 235 240 

Ser Trp Phe Arg Glu Ala Glu He T^'r Gin Thr Val Met Leu Arg Hia 
245 250 255 

Glu Asn He Leu Gly Phe lie Ala Ala Aap Aan Lya Asp Aan Gly Thr 
260 265 270 

Trp Thr Gin Leu Trp Leu Val Ser Aap Tyr Hia Glu Hia Gly Ser Leu 
275 280 285 

Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val Glu Gly Met He Lys 
290 295 300 

Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala Hia Leu His Met Glu He 
305 310 315 320 

Val Gly Thr Gin Gly Lys Pro Ala Ha Ala Hia Arg Asp Leu Lya Ser 
325 330 335 

Lys Asn He Leu Val Lys Lya Asn Gly Thr Cya Cys He Ala Asp Leu 
340 345 350 

Gly Leu Ala Val Arg Hia Aap Ser Ala Thr Asp Thr He Asp He Ala 
355 360 365 

Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu 
370 375 380 

Asp Asp Ser He Asn Met Lys His Phe Glu Ser Phe Lys Arg Ala Asp 



385 



390 



395 



400 
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He Tyr Ala Met Cly Lou Val Phe Trp Glu He Ala Arg Arg Cya Ser 
405 410 415 

lie Gly Gly He His Glu Asp Tyr Gin Leu Pro Tyr Tyr Asp Leu Val 
420 425 430 

Pro ser Asp Pro Ser Val Glu Glu Met Arg Lya Val Val Cya Glu Gin 
435 440 445 

Lya Leu Arg Pro Asn He Pro Aon Arg Trp Gin Ser Cya Glu Ala Leu 
450 455 460 

Ara Val Met Ala Lya He Met Arg Glu Cya Trp Tyr Ala Aan Gly Ala 
465 470 475 480 

Ala Arg Leu Thr Ala Leu Arg He Lye Lya Thr Leu Ser Gin Leu Ser 
485 490 495 

Gin Gin Glu Gly He Lya Met 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: unknown 
(0) TOPOLOGY: linear 

(ti) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(ill) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 241.. 1746 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAGAGCACAG CCCTTCCCAG TCCCCGGAGC CGCCGCCCCA CGCCCGCATG ATCAAGACCT 

TTTCCCCGGC CCCACAGGGC CTCTGGACGT GAGACCCCGG CCGCCTCCGC AAGGAGACGC 

GGGGGTCGAO TCGCCCTGTC CAAAGGCCTC AATCTAAACA ATCTTGATTC CTCTTGCCGC 

CTGGCGGGAC CCTGAATGGC ACGAAATCTC ACCACATCTC TTCTCCTATC TCCAAGGACC 

ATG ACC TTG GGG AGC TTC AGA AGG GGC CTT TTG ATG CTG TCG GTG GCC 
Met Thr Leu Gly Ser Phe Arg Arg Gly Leu Leu Met Leu Ser Val Ala 



60 
120 
180 
240 
288 



500 



1 



5 
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TTG GGC CTA ACC CAG CGG AGA CTT GCC AAG CCT TCC AAG CTC CTG AAC 336 
Leu Gly Leu Thr GXn Gly Arg Leu Ala Lys Pro Ser Lya Leu Val Aan 
20 25 30 

TGC ACT TGT GAG AGC CCA CAC TGC AAG AGA CCA TTC TCC CAG GGC TCA 384 
Cys Thr cy« Glu Ser Pro Hi* Cye Lya Arg Pro Phe Cya Gin Gly Ser 
35 40 45 

TGG TGC ACA GTG GTG CTG GTT CCA GAG CAG GGC AGG CAC CCC CAG GTC 432 
Trp Cya Thr Val Val Leu Val Arg Glu Gin Gly Arg Hia Pro Gin Val 
50 55 60 

TAT CGG GGC TGT GGG AGC CTG AAC CAG GAG CTC TGC TTG GGA CGT CCC 480 
Tvr Arg Gly Cya Gly Ser Leu Aan Gin Glu Leu Cya Leu Gly Arg Pro 
65 70 75 80 

ACG GAG TTT CTG AAC CAT CAC TGC TGC TAT AGA TCC TTC TGC AAC CAC 528 
Thr Glu Phe Leu Aan Hia Hio Cya Cya Tyr Arg Ser Phe Cya Aan His 
85 90 95 

AAC GTC TCT CTC ATG CTG GAG GCC ACC CAA ACT CCT TCG GAG CAG CCA 576 
Aan Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

GAA GTT GAT GCC CAT CTG CCT CTG ATC CTG GGT CCT GTG CTC GCC TTG 624 
Glu Val Aap Ala Hia Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 
115 120 125 

CCG GTC CTG GTC CCC CTC GGT GCT CTG GGC TTG TGG CGT GTC CCC CGG 672 
Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

AGG CAG GAG AAG CAG CGG GAT TTC CAC ACT GAC CTG GGC GAG TCC ACT 720 

Arg Gin Glu Lya Gin Arg Asp Leu Hie Ser Aap Leu Gly Glu Ser Ser 
145 ISO 155 160 

CTC ATC CTG AAG GCA TCT GAA CAG GCA GAC AGC ATG TTG GGC GAC TTC 768 
Leu lie Leu Lya Ala Ser Glu Gin Ala Aap Ser Met Leu Gly Aap Phe 
165 170 175 

CTG GAC AGC GAC TGT ACC ACG GGC AGC GGC TCG GGG CTC CCC TTC TTG 816 
Leu Asp Ser Asp Cya Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
180 185 190 

GTG CAG AGG ACG GTA GCT CGG CAG GTT CCG CTG CTA GAG TGT GTG GGA 864 
Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cya Val Gly 
195 200 205 

AAG GGC CCA TAT GGC GAG GTG TGG CGC GGT TCG TGG CAT GGC GAA AGC 912 
Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp His Gly Glu Ser 
210 215 220 

GTG GCG GTC AAC ATT TTC TCC TCA CGA GAT GAG CAG TCC TGC TTC CCG 960 
Val Ala Val Lya lie Phe Ser Ser Arg Aap Glu Gin Ser Trp Phe Arg 
225 230 235 240 

GAG ACG GAG ATC TAC AAC ACA GTT CTG CTT AGA CAC GAC AAC ATC CTA 1008 
Glu Thr Glu lie Tyr Aan Thr Val Leu Leu Arg His Asp Asn lie Leu 
245 250 255 
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GGC TTC ATC CCC TCC GAC ATG ACT TCC CGC AAC TCO AGC ACC CAG CTG 
Glv Phe He Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu 
260 265 270 

TGG CTC ATC ACC CAC TAC CAT GAA CAC GGC TCC CTC TAT GAC TTT CTG 
Trp Leu He Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu 
275 280 285 

CAG AGG CAG ACG CTG CAG CCC CAG TTC CCC CTG AGG CTA GCT GTG TCC 
Gin Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val Ser 
290 295 300 

CCC CCC TGC GGC CTC GCG CAC CTA CAT GTC GAC ATC TTT GGC ACT CAA 
Pro Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Cly Thr Gin 
305 310 315 320 

GGC AAA CCA CCC ATT CCC CAT CGT GAC CTC AAC AGT CGC AAT GTC CTC 
Glv Lvs Pro Ala He Ala His Arg Asp Leu Lys Ser Arg Asn Val Leu 
325 330 335 

GTC AAG AGT AAC TTG CAG TGT TGC ATT GCA GAC CTC GGA CTC GCT GTC 
Val Lys Ser Asn Leu Gin Cys Cys He Ala Asp Leu Cly Leu Ala Val 
340 345 350 

ATG CAC TCA CAA AGC AAC GAC TAC CTC GAT ATC GGC AAC ACA CCC CGA 
Met His Ser Gin Ser Asn Glu Tyr Leu Asp He Gly Asn Thr Pro Arg 
355 360 365 

GTG CGT ACC AAA ACA TAC ATG GCA CCC GAC GTC CTC GAT CAC CAC ATC 
Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu His He 
370 375 380 

CGC ACA GAC TGC TTT GAG TCG TAC AAG TGG ACA GAC ATC TGC GCC TTT 
Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp He Trp Ala Phe 
385 390 395 400 

GGC CTA CTG CTA TGC CAG ATC GCC CCC CCC ACC ATC ATC AAT GGC ATT 
Gly Leu Val Leu Trp Glu He Ala Arg Arg Thr Ho He Asn Gly He 
405 410 415 

GTG GAG GAT TAC ACC CCA CCT TTC TAT GAC ATC CTA CCC AAT CAC CCC 

Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Met Val Pro Aan Asp Pro 
420 425 430 

AGT TTT GAC CAC ATG AAA AAG GTC CTG TCC GTT GAC CAG CAG ACA CCC 
Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 
435 440 445 

ACC ATC CCT AAC CCC CTC CCT GCA GAT CCC GTC CTC TCC GGC CTG GCC 
Thr He Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
450 455 460 

CAG ATG ATG ACA GAC TGC TGG TAC CCC AAC CCC TCT GCT CGC CTC ACC 
Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 
465 470 475 480 

GCA CTG CGC ATA AAC AAG ACA TTG CAC AAC CTC AGT CAC AAT CCA GAG 
Ala Leu Arg He Lys Lvs Thr Leu Gin Lys Leu Ser His Asn Pro Glu 
485 490 495 



1056 
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AAG CCC AAA GTG ATT CAC TAGCCCACCC CCACCAGGCT TCCTCTCCCT 1776 
Lye Pro Lys Val Zla Hla 
500 

AAAGTGTGTG CTGGGGAAGA AGACATAGCC TGTCTGGGTA GAGGGAGTGA AGAGACTGTC 1836 
CACGCTGCCC TGTGTGTGCC TGCTCAGCTT GCTCCCAGCC CATCCAGCCA AAAATACAGC 1896 
TGAGCTGAAA TTCAAAAAAA AAAAAA 1922 

(2) INFORMATION FOR SEQ ZD NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acida 

(B) TYPE: aiaino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECUI.E TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Thr Leu Gly Ser Phe Arg Arg Gly Leu" Leu Met Leu Ser Val Ala 
15 10 15 

Leu Gly Leu Thr Gin Gly Arg Leu Ala Lya Pro Ser Lya Leu Val Asn 
20 25 30 

Cys Thr Cys Glu Ser Pro His Cys Lys Arg Pro Phe Cya Gin Gly Ser 
35 40 45 

Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg His Pro Gin Val 
50 55 60 

Tyr Arg Gly Cys Gly Ser Leu Asn Gin Glu Leu Cya Leu Gly Arg Pro 
65 70 75 80 

Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Phe Cys Asn His 

85 90 95 

Asn Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

Glu Val Asp Ala His Leu Pro Leu Zle Leu Gly Pro Val Leu Ala Leu 
115 120 125 

Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Gly Glu Ser Ser 
145 150 155 160 

Leu lie Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 
165 170 175 

Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
180 185 190 
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Val Gin Arg Thr Val Ala Arg Gin Val Ala Lou Val Glu Cys Val Gly 
195 200 205 

Lya Gly Arg Tyr Gly Glu Val Trp Arg Gly Sar Trp His Gly Glu Ser 
210 215 220 

Val Ala Val Lye lie Phe Ser Ser Arg Aep Glu Gin Ser Trp Phe Arg 
225 230 235 240 

Glu Thr Glu lie Tyr Acn Thr Val Leu Leu Arg His Asp Asn lie Leu 
245 250 255 

Gly Phe lie Ala Ser Asp Met: Thr Ser Arg Asn Ser Ser Thr Gin Leu 
260 265 270 

Trp Leu lie Thr His Tvr His Glu His Gly Ser Leu Tyr Asp Phe Leu 
275 ' 280 285 

Gin Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val Ser 
290 295 300 

Pro Ala Cys Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin 
305 310 315 320 

Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Arg Asn Val Leu 
325 330 335 

Val Lys Ser Aan Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val 
340 345 350 

Met: His Ser Gin Ser Asn Glu Tyr Leu Asp lie Gly Asn Thr Pro Arg 
355 360 365 

Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu His lie 
370 375 380 

Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe 
385 390 395 400 

Gly Leu Val Leu Trp Glu lie Ala Arg Arg Thr lie lie Asn Gly lie 
405 410 415 

Val Glu Asp Tyr Arg Pro Pro Phe Tyr Aep Met Val Pro Asn Asp Pro 
420 425 430 

Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 
435 440 445 

Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
450 455 460 

Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 
465 470 475 480 

Ala Leu Arg lie Lys Lys Thr Leu Gin Lys Leu Ser His Asn Pro Glu 
485 490 495 

Lys Pro Lys Val lie His 
500 
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65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2070 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL! NO 

(iii) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 217.. 1812 



PCT/GB93/02367 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTCGGAGAA ATTGGAACTA CAGTTTTATC 60 

TAGCCACATC TCTGAGAATT CTGAAGAAAG CAGCAGGTGA AAGTCATTGC CAAGTGATTT 120 

TGTTCTGTAA GGAAGCCTCC CTCATTCACT TACACCAGTG AGACAGCAGO ACCAGTCATT 180 

CAAAGGGCCG TGTACAGGAC GCGTGGCAAT CAGACA ATG ACT CAG CTA TAC ACT 234 

Met Thr Gin Leu Tyr Thr 
1 5 

TAC ATC AGA TTA CTG CGA GCC TGT CTG TTC ATC ATT TCT CAT GTT CAA 282 
Tyr lie Aro Leu Leu Gly Ala Cya Leu Phe lie lie Ser His Val Gin 
10 15 20 

GGG CAG AAT CTA GAT AGT ATC CTC CAT GGC ACT GGT ATG AAA TCA GAC 330 
Gly Gin Asn Leu Asp Ser Met Leu His Gly Thr Gly Met Lya Ser Asp 
25 30 35 

TTG GAC CAG AAG AAG CCA GAA AAT GGA GTG ACT TTA GCA CCA CAG GAT 378 
Leu Asp Gin Lya Lya Pro Glu Aan Gly Val Thr Leu Ala Pro Glu Asp 
40 45 50 

ACC TTG CCT TTC TTA AAG TGC TAT TGC TCA GGA CAC TGC CCA GAT GAT 426 
Thr Leu Pro Phe Leu Lya Cya Tyr Cya Ser Gly His Cya Pro Asp Aap 
55 60 65 70 

GCT ATT AAT AAC ACA TGC ATA ACT AAT GGC CAT TGC TTT GCC ATT ATA 474 
Ala lie Aan Aan Thr Cya lie Thr Aan Gly Hia Cya Phe Ala lie lie 
75 80 85 

GAA GAA GAT GAT CAG GGA GAA ACC ACA TTA ACT TCT GGG TGT ATG AAG 522 
Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu Thr Ser Gly Cys Met Lya 
90 95 100 
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TAT GAA GGC TCT GAT TTT CAA TGC AAC GAT TCA CCC AAA GCC CHG CTA 570 
Tvr Glu Gly Ser Aap Pho Gin Cyo Ly» Asp Ser Pro Lya Ala Gin Leu 
105 110 115 

CGC AGG ACA ATA GAA TGT TGT CGG ACC AAT TTG TGC AAC CAG TAT TTG 618 
Aro Ara Thr lie Glu Cya Cys Arg Thr Aon Leu Cya Aar. Gin Tyr Lou 
120 125 130 

CAG OCT ACA CTG CCC CCT GTT GTT ATA GGT CCC TTC TTT GAT GGC AGC 666 
Gin Pro Thr Leu Pro Pro Val Val lie Gly Pro Phe Phe Aap Gly Ser 
135 140 145 150 

ATC CGA TGG CTG GTT GTG CTC ATT TCC ATG GCT GTC TGT ATA GTT GCT 714 
lie Ara Trp Leu Val Val Leu lie Ser Met Ala Val Cya lie Val Ala 
ISS 160 165 

ATG ATC ATC TTC TCC AGC TGC TTT TGC TAT AAG CAT TAT TGT AAG AGT 762 
Met lie lie Phe Ser Ser Cya Phe Cya Tyr Lya Hla Tyr Cya Lya Ser 
170 175 180 

ATC TCA ACC ACC CCT CCT TAG AAC CGT CAT TTG GAA CAG CAT GAA CCA 810 
lie Ser Ser Arg Gly Arg Tyr Aan Arg Aap Leu Glu Gin Aap Glu Ala 
185 190 195 

TTT ATT CCA GTA GCA GAA TCA TTC AAA GAC CTG ATT GAC CAG TCC CAA 858 
Phe lie Pro Val Gly Glu Ser Leu Lya Aap Leu lie Aap Gin Ser Gin 
200 205 210 

AGC TCT GGG ACT CGA TCT CGA TTG CCT TTA TTC GTT CAG CGA ACT ATT 906 
Ser Ser Cly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr lie 
215 220 225 230 

GCC AAA CAG ATT CAG ATC GTT CGC CAG GTT GGT AAA GGC CGC TAT GCA 954 
Ala Lya Gin lie Gin Met Val Arg Gin Val Gly Lya Cly Arg Tyr Gly 
235 240 245 

GAA GTA TGC ATG GCT AAA TCC CCT GCT GAA AAA CTC GCT GTC AAA GTG 1002 
Glu Val Trp Met Gly Lya Trp Arg Gly Glu Lya Val Ala Val Lya Val 
250 255 260 

TTT TTT ACC ACT GAA GAA GCT AGC TGG TTT ACA GAA ACA GAA ATC TAC 1050 
Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe Arg Glu Thr Glu lie Tyr 
265 270 275 

CAG ACC GTG TTA ATC CCT CAT GAA AAT ATA CTT GCT TTT ATA GCT CCA 1098 
Gin Thr Val Leu Met Arg Hie Glu Aan He Leu Gly Phe He Ala Ala 
280 285 290 

GAC ATT AAA GGC ACT GGT TCC TGC ACT CAG CTG TAT TTC ATT ACT CAT 1146 
Aap He Lya Gly Thr Gly Ser Trp Thr Gin Leu Tyr Leu He Thr Aap 
295 300 305 310 

TAC CAT GAA AAT CGA TCT CTC TAT GAC TTC CTG AAA TGT GCC ACA CTA 1194 
Tyr Hia Glu Aan Gly Ser Leu Tyr Aap Phe Leu Lya Cya Ala Thr Leu 
315 320 325 

GAC ACC ACA GCC CTA CTC AAG TTA GCT TAT TCT GCT GCT TGT GGT CTG 1242 
Aap Thr Arg Ala Leu Leu Lya Leu Ala Tyr Ser Ala Ala Cya Cly Leu 
330 335 340 
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TGC CAC CTC CAC ACA CAA ATT TAT COT ACC CAA CGG AAC CCT CCA ATT 1290 
Cys His Leu His Thr Clu Ilo Tyr Gly Thr Gin Gly Lya Pro Ala lie 
34S 350 355 

OCT CAT CGA GAC CTG AAG AGC AAA AAC ATC CTT ATT AAG AAA AAT GGA 1338 
Ala Hie Arg Asp Leu Lya Ser Lya Aan lie Leu lie Lya Lya Aan Gly 
360 365 370 

AGT TGC TGT ATT GCT CAC CTC GGC CTA CCT GTT AAA TTC AAC ACT GAT 1386 
Ser Cya Cya lie Ala Asp Leu Gly Leu Ala Val Lya Phe Aan Ser Aap 
375 380 385 390 

ACA AAT GAA CTT CAC ATA CCC TTC AAT ACC AGC GTG GGC ACC AAC CGG 1434 

Thr Asn Glu Val Aap lie Pro Leu Aan Thr Arg Val Gly Thr Lya Arg 
395 400 405 

TAC ATC GCT CCA GAA GTG CTG GAT GAA AGC CTC AAT AAA AAC CAT TTC 1482 
Tyr Met Ala Pro Clu Val Leu Aap Glu Ser Leu Aan Lya Aan Hia Phe 
410 415 420 

GAG CCC TAC ATC ATC CCT GAC ATC TAT ACC TTT CCT TTC ATC ATT TGC 1530 
Gin Pro Tyr lie Met Ala Aap lie Tyr Ser Phe Gly Leu lie lie Trp 
425 430 435 

GAA ATC GCT CCT CCT TGT ATT ACA GGA CGA ATC GTG GAC GAA TAT CAA 1578 
Glu Met Ala Arg Arg Cya lie Thr Gly Gly lie Val Glu Glu Tyr Gin 
440 445 450 

TTA CCA TAT TAC AAC ATC CTC CCC AGT GAC CCA TCC TAT GAC GAC ATC 1626 
Leu Pro Tyr Tyr Aan Met Val Pro Ser Aap Pro Ser Tyr Clu Asp Met 
455 460 465 470 

CCT GAG GTT GTG TGT CTC AAA CGC TTG CCC CCA ATC GTG TCT AAC CCC 1674 
Arg Glu Val Val Cya Val Lya Arg Leu Arg Pro lie Val Ser Aan Arg 
475 480 485 



TGC AAC AGC GAT GAA TGT CTT CGA CCA GTT TTG AAG CTA ATC TCA GAA 1722 

Trp Aan Ser Aap Glu Cya Leu Arg Ala Val Leu Lya Leu Met Ser Glu 
490 495 500 

TGT TGC CCC CAT AAT CCA CCC TCC AGA CTC ACA GCT TTC ACA ATC AAG 1770 

Cya Trp Ala Hia Aan Pro Ala Ser Arg Leu Thr Ala Leu Arg lie Lya 

505 510 515 



AAG ACA CTT CCA AAA ATC GTT CAA TCC CAC GAT CTA AAG ATT 1812 
Lya Thr Leu Ala Lya Met Val Glu Ser Gin Asp Val Lya lie 
520 525 530 



TGACAATTAA 


ACAATTTTGA 


CGGAGAATTT 


AGACTGCAAG 


AACTTCTTCA 


CCCAACGAAT 


1872 


GGGTGGGATT 


ACCATCCAAT 


ACCATCTTGA 


CTTGGTTTCC 


AGACTCCTTC 


CTCTACATCT 


1932 


TCACAGCCTG 


CTAACACTAA 


ACCTTACCCT 


ACTCTACACA 


ATACAAGATT 


GGAACTTCGA 


1992 


ACTTCAAACA 


TGTCATTCTT 


TATATATCAC 


AGCTTTGTTT 


TAATGTGGGC 


TTTTTTTGTT 


2052 


TGCTTTTTTT 


GTTTTCTT 










2070 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SSQUBNCB CHARACTERISTICS: 

(A) LENGTH t 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOIXX2Y: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Thr Gin Leu Tyr Thr Tyr lie Arg Leu Leu Gly Ala Cys Leu Phe 
IS 10 IS 

lie lie Ser His Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
20 25 30 

Thr Gly Met Lys Ser Aap Leu Asp Gin Lys Lya Pro Glu Asn Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cya Tyr Cys Ser 
SO 55 60 

Gly His Cys Pro Asp Asp Ala lie Asn Asn Thr Cys lie Thr Asn Gly 
65 70 75 80 

His Cys Phe Ala lie lie Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 

85 90 95 

Thr Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 110 

Ser Pro Lys Ala Gin Leu Arg Arg Thr lie Glu Cys Cys Arg Thr Asn 
115 120 125 

Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val lie Gly 
130 135 140 

Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Val Leu lie Ser Met 
145 150 155 160 

Ala Val Cys lie Val Ala Met lie lie Phe Ser Ser Cys Phe Cys Tyr 
165 170 175 

Lys His Tyr Cys Lys Ser lie Ser Ser Arg Gly Arg Tyr Asn Arg Asp 
180 185 190 

Leu Glu Gin Asp Glu Ala Phe lie Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

Leu lie Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 
245 250 255 
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Lye Val 



Ala 



Val Lys Val 
260 



Phe Phe Thr Thr Glu Glu Ala Ser 
265 270 



Trp 



Phe 



Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His Glu Asn lie 
275 280 285 

Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lvs Cvs Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 

Ser Ala Ala Cys Cly Leu Cys His Lou His Thr Glu lie Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys Asn lie 
355 360 365 

Leu lie Lys Lys Asn Gly Ser Cys Cys lie Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp lie Pro Leu Asn Thr 
385 390 395 400 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp lie Tyr Ser 
420 425 430 

Phe Gly Leu lie lie Trp Glu Met Ala Arg Arg Cys lie Thr Gly Gly 
435 440 445 

lie Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Mot Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

Asp Val Lys lie 



530 



(2) INFORMATION FOR SEQ 



ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 
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(B) TYPBt nucleic acid 

( C ) STRANDEDNZSS : unknown 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(ill) ANTI-SENSE t NO 

(V) FRAGKENT TYPE: Internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

( ix ) FEATURE : 

(A) NAME/KEY: COS 

(B) LOCATION: 10.. 1524 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGCGGTTAC ATG GCC GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT 48 
Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu 
15 10 

GTT GTC CTC CTG CTC GCC GGC AGC GCC GGG TCC GGG CCC CGC GGG ATC 96 

Val Val Leu Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie 
15 20 25 

CAG GCT CTG CTG TGT GCG TGC ACC AGC TGC CTA CAC ACC AAC TAC ACC 144 
Gin Ala Leu Leu Cys Ala Cys Thr Ser Cye Leu Gin Thr Asn Tyr Thr 
30 35 40 45 

TGT GAG ACA GAT GGG GCT TGC ATG GTC TCC ATC TTT AAC CTG GAT GGC 192 
Cys Glu Thr Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu Asp Gly 
50 55 60 

CTG GAG CAC CAT GTA CCT ACC TGC ATC CCC AAG GTG GAG CTG GTT CCT 240 
Val Glu His His Val Arg Thr Cys lie Pro Lys Val Glu Leu Val Pro 
65 70 75 

GCT GGA AAG CCC TTC TAC TGC CTG ACT TCA GAG GAT CTC CGC AAC ACA 288 
Ala Gly Lys Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr 
80 85 90 

CAC TGC TGC TAT ATT CAC TTC TGC AAC AAG ATT GAC CTC AGC GTC CCC 336 
His Cys Cys Tyr lie Asp Phe Cys Asn Lys lie Asp Leu Arg Val Pro 
95 100 105 

AGC GGA CAC CTC AAG GAG CCT GCG CAC CCC TCC ATG TGG GGC CCT GTG 384 
Ser Gly His Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val 
110 115 120 125 

GAG CTG GTC GGC ATC ATC GCC GGC CCC GTC TTC CTC CTC TTC CTT ATC 432 
Glu Leu Val Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie 
130 135 140 
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ATT ATC ATC CTC 
lie lie lie Val 
145 



TTC CTC CTC ATC AAC TAT CAC 
Phe Leu Val lie Asn Tyr Hie 
150 



CAG CCT CTC TAC CAT 
Gin Arg Val Tyr His 
155 



480 



AAC CGC CAG AGG 
Asn Arg Gin Arg 
160 



TTG GAC ATG GAG GAC CCC TCT 
Leu Asp Met Glu Asp Pro Ser 
165 



TGC GAG ATC TCT CTC 
Cys Glu Met Cys Leu 
170 



528 



TGC AAA GAC AAC 
Ser Lys Asp Lya 
175 



ACG CTC CAG GAT CTC CTC TAC 
Thr Leu Gin Asp Leu Val Tyr 
180 



GAC CTC TCC ACG TCA 
Asp Leu Ser Thr Ser 
185 



576 



GGG TCT GGC TCA 
Gly Ser Gly Ser 
190 



GGG TTA CCC CTT TTT CTC CAG 
Gly Leu Pro Leu Phe Val Gin 
195 200 



CGC ACA GTG CCC CCA 
Arg Thr Val Ala Arg 
205 



624 



ACC ATT CTT TTA 
Thr lie Val Leu 



CAA GAG ATT ATC GGC AAG GGC 
Gin Glu lie lie Gly Lys Gly 
210 215 



CGG TTC GGG GAA GTA 
Arg Phe Gly Glu Val 
220 



672 



TGG CGT GGT CGC 
Trp Arg Gly Arg 
225 



TGG AGG GGT GGT GAC GTG CCT 
Trp Arg Gly Gly Asp Val Ala 
230 



GTG AAA ATC TTC TCT 
Val Lys lie Phe ser 
235 



720 



TCT CGT GAA GAA 
Ser Arg Glu Glu 
240 



CGG TCT TGG TTC CGT GAA CCA 
Arg Ser Trp Phe Arg Glu Ala 
245 



CAG ATC TAC CAG ACC 
Glu lie Tyr Gin Thr 
250 



768 



GTC ATG CTC CGC 
Val Met Leu Arg 
255 



CAT GAA AAC ATC CTT GGC TTT 
His Glu Asn Zle Leu Gly Phe 
260 



ATT GCT GCT GAC AAT 
lie Ala Ala Asp Asn 
265 



816 



AAA GAT AAT GGC 
Lys Asp Asn Gly 
270 



ACC TGG ACC CAG CTC TGG CTT 
Thr Trp Thr Gin Leu Trp Leu 
275 280 



GTC TCT GAC TAT CAC 
Val Ser Asp Tyr His 
285 



864 



GAG CAT GGC TCA 
Glu His Gly Ser 



CTG TTT GAT TAT CTG AAC CGC 
Leu Phe Asp Tyr Leu Asn Arg 
290 295 



TAC ACA GTC ACC ATT 
Tyr Thr Val Thr lie 
300 



912 



GAG GGA ATG ATT 
Glu Gly Met lie 
305 



AAG CTA CCC TTG TCT CCA CCC 
Lys Leu Ala Leu Ser Ala Ala 
310 



ACT GGT TTG GCA CAC 
Ser Gly Leu Ala His 
315 



960 



CTG CAT ATG GAG 
Leu His Met Glu 
320 

CGA GAC TTG AAG 
Arg Asp Leu Lys 
335 



ATT GTG GGC ACT CAA GGG AAG 
lie Val Gly Thr Gin Gly Lya 
325 

TCA AAG AAC ATC CTC GTG AAA 
Ser Lys Asn lie Leu Val Lys 
340 



CCG GGA ATT GCT CAT 1008 
Pro Gly lie Ala His 
330 

AAA AAT GGC ATG TCT 1056 

Lys Asn Gly Met Cys 

345 



GCC ATT GCA CAC 

Ala lie Ala Asp 
350 

ACC ATA GAC ATT 
Thr lie Asp lie 



CTG GGC CTG GCT GTC CGT CAT 
Leu Gly Leu Ala Val Arg His 
355 360 

GCT CCA AAT CAG AGG GTG GGG 
Ala Pro Asn Gin Arg Val Cly 
370 375 



GAT GCC GTC ACT GAC 1104 
Asp Ala Val Thr Asp 
365 

ACC AAA CGA TAC ATG 1152 
Thr Lys Arg Tyr Met 
380 
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GCT CCT GAA GTC CTT GAC GAG ACA ATC AAC ATG AAG CAC TTT GAC TCC X200 

Ala Pro Glu Val Leu Aap Glu Thr He Asn Met Lya His Phe Asp Ser 
385 390 395 

TTC AAA TOT GCC GAC ATC TAT GCC CTC GGC CTT GTC TAC TGO GAO ATT 1248 
Phe Lya Cya Ala Aap Ho Tyr Ala Leu Gly Leu Val Tyr Trp Glu He 
400 405 410 

GCA CGA ACA TGC AAT TCT GGA CGA GTC CAT GAA GAC TAT CAA CTG CCC 1296 

Ala Arg Arg Cya Aan Sar Gly Gly Val His Glu Asp Tyr Gin Leu Pro 
415 420 425 

TAT TAC GAC TTA GTG CCC TCC GAC CCT TCC ATT GAG GAG ATC CGA AAG 1344 
Tyr Tyr Aap Leu Val Pro Ser Aap Pro Ser He Glu Glu Met Arg Lya 
430 435 440 445 

GTT GTA TGT GAC CAG AAG CTA COG CCC AAT GTC CCC AAC TGG TGG CAG 1392 
Val Val Cya Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin 
450 455 460 

AGT TAT GAG GCC TTG CGA GTG ATG GGA AAG ATG ATG CGG GAG TGC TGG 1440 
Ser Tyr Glu Ala Leu Arg Val Met Gly Lya Met Met Arg Glu Cya Trp 
465 470 475 

TAC GCC AAT GGT GCT GCC CGT CTG ACA GCT CTG CGC ATC AAG AAG ACT 1488 
Tyr Ala Aan Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lya Lya Thr 
480 485 490 

CTG TCC CAG CTA AGC GTG CAG GAA GAT GTG AAG ATT TAACCTCTTC 1534 
Leu Ser Gin Leu Ser Val Gin Glu Aap Val Lya He 
495 500 505 



CTCTGCCTAC 


ACAAAGAACC 


TGGCCAGTGA 


GGATGACTGC 


AGCCACCGTG 


CAAGCGTCGT 


1594 


GGAGGCCTAT 


CCTCTTGTTT 


CTGCCCGGCC 


CTCTGGCAGA 


CCCCTCCCCT 


GCAAGAGGGA 


1654 


CAGAGCCTGG 


GAGACCCGCG 


CACTCCCG TT 


GGGTTTGAGA 


CAGACACTTT 


TTATATTTAC 


1714 


CTCCTGATGG 


CATGGAGACC 


TGAGCAAATC 


ATGTAGTCAC 


TCAATGCCAC 


AACTCAAACT 


1774 


GCTTCAGTGG 


GAAGTACAGA 


GACCCAGTGC 


ATTGCGTGTG 


CAGGAGCGTG 


AGGTGCTGGG 


1834 


CTCGCCAGGA 


GCGGCCCCCA 


TACCTTGTGG 


TCCACTCCGC 


TGCAGGTTTT 


CCTCCAGGGA 


1894 


CCAGTCAACT 


GGCATCAAGA 


TATTGAGAGG 


AACCGGAAGT 


TTCTCCCTCC 


TTCCCGTAGC 


1954 


AGTCCTGAGC 


CACACCATCC 


TTCTCATGCA 


CATCCGGAGG 


ACTGCCCCTA 


GAGACACAAC 


2014 


CTGCTGCCTC 


TCTCTCCACC 


CAAGTGCGCA 


TGTGCCGAGG 


TGTGTCCCAC 


ATTGTGCCTG 


2074 


GTCTGTGCCA 


CGCCCGTGTG 


TGTGTCTGTG 


TGTGTGAGTG 


AGTGTCTGTG 


TGTACACTTA 


2134 


ACCTGCTTGA 


GCTTCTGTGC 


ATGTGT 








2160 



(2) INFORMATION FOR SEQ ID NOt 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 505 amino acida 
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(B) TYPBt amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met: Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 IS 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie Gin Ala Leu 
20 25 30 

Lou Cya Ala Cys Thr Ser Cya Leu Gin Thr Asn Tyr Thr Cys Glu Thr 
35 40 45 

Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu Asp Gly Val Glu His 
50 55 60 

Hia Val Arg Thr Cya lie Pro Lye Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

Pro Phe Tyr Cya Leu Ser Ser Glu Asp Leu Arg Asn Thr Hia Cya Cya 
85 90 95 

Tyr lie Asp Phe Cya Aan Lya lie Aap Leu, Arg Val Pro Ser Gly Hia 
100 105 110 

Leu Lya Glu Pro Ala Hia Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie He He He 
130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
145 ISO 155 160 

Arg Leu Asp Met Glu Aap Pro Ser Cya Glu Met Cya Leu Ser Lya Aap 
165 170 175 

Lya Thr Leu Gin Aap Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr He Val 
195 200 205 

Leu Gin Glu He He Gly Lya Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

Arg Trp Arg Gly Gly Asp Val Ala Val Lya He Phe Ser Ser Arg Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu 
245 250 255 

Arg Hia Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Aap Aan 
260 265 270 
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Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Aep Tyr His Glu Hi* Gly 
275 280 28S 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr VaX Thr lie Glu Gly Met: 
290 295 300 

lie Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 
305 310 315 320 

Glu lie Val Gly Thr Gin Gly Lys Pro Gly lie Ala His Arg Asp Leu 
325 330 335 

Lys Ser Lys Asn lie Leu Val Lys Lys Asn Gly Met Cys Ala lie Ala 
340 345 350 

Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr lie Asp 
355 360 365 

lie Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 

Val Leu Asp Glu Thr lie Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390 395 400 

Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie Ala Arg Arg 
405 410 415 

Cya Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 
485 490 495 

Leu Ser Val Gin Glu Asp Val Lys lie 
500 505 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1952 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 
( O ) TOPOLOGY : unluiown 

(ii) MOLECULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 
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(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mousa 

( ix ) FEATURE : 

(A) NAME/KEY; CDS 

(B) LOCATION: 187.. 1692 



(xl) SEQUENCE DESCRIPTION: SEQ ID HO: 17: 

AAGCGGCGGC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG GGCGCGGAGG ACCCCGGACC 60 

TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC ATTTGGCGCT GAGCTATGAC 120 

AAGAGAGCAA ACAAAAAGTT AAAGGAGCAA CCCGGCCATA AGTGAAGAGA GAACTTTATT 180 

GATAAC ATG CTC TTA CGA AGO TCT GGA AAA TTA AAT GTG GGC ACC AAG 228 
Met Leu Leu Arg Ser Ser Gly Lya Leu Asn Val Gly Thr Lya 
15 10 

AAG GAG GAT GGA GAG ACT ACA GCC CCC ACC CCT CGG CCC AAG ATC CTA 276 
Lye Glu Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys lie Leu 
15 20 25 30 

CGT TGT AAA TGC CAC CAC CAC TGT CCG GAA GAC TCA GTC AAC AAT ATC 324 
Arg Cys Lya Cya Hia Hia Hia Cya Pro Glu Aap Ser Val Aan Aan lie 
35 40 45 

TGC AGC ACA GAT GGG TAC TGC TTC ACG ATG ATA GAA GAA GAT GAC TCT 372 
Cys Ser Thr Aap Gly Tyr Cya Phe Thr Met lie Glu Glu Asp Aap Ser 
50 55 60 

GGA ATG CCT GTT GTC ACC TCT GGA TGT CTA GGA CTA GAA GGC TCA GAT 420 
Gly Met Pro Val Val Thr Ser Gly Cya Leu Gly Leu Glu Gly Ser Aap 
65 70 75 

TTT CAA TGT CGT GAC ACT CCC ATT CCT CAT CAA AGA AGA TCA ATT GAA 468 
Phe Gin Cya Arg Aap Thr Pro lie Pro Hia Gin Arg Arg Ser lie Glu 
80 85 90 

TGC TGC ACA GAA AGC AAT GAC TGT AAT AAA GAC CTC CAC CCC ACT CTC 516 
Cya Cya Thr Glu Arg Aan Glu Cya Aan Lya Aap Leu Hia Pro Thr Leu 
95 100 105 110 

CCT CCT CTC AAC GAC AGA GAT TTT GTT CAT GGC CCC ATA CAC CAC AAC 564 
Pro Pro Leu Lys Asp Arg Asp Phe Val Asp Gly Pro lie His His Lys 
115 120 125 

GCC TTG CTT ATC TCT CTC ACT GTC TGT ACT TTA CTC TTG GTC CTC ATT 612 
Ala Leu Leu lie Ser Val Thr Val Cya Ser Leu Leu Leu Val Leu lie 
130 135 140 

ATT TTA TTC TGT TAC TTC AGO TAT AAA AGA CAA GAA GCC CGA CCT CGG 660 

lie Leu Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg 
145 150 155 
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TAC AGC ATT GGC CTC GAG CAC CAC GAG ACA TAC ATT CCT CCT GGA GAG 708 
Tyr SGr lie Gly Leu Clu Gin Asp Glu Thr Tyr lie Pro Pro Gly Glu 
160 165 170 

TCC CTG ACA GAC TTC ATC GAG CAG TCT CAG AGC TCXS GGA AGT GGA TCA 756 
Ser Leu Arg Asp Leu lie Glu Gin Ser Gin Ser Sor Gly Ser Gly Ser 
175 180 185 190 

GGC CTC CCT CTG CTG GTC CAA AGG ACA ATA GCT AAG CAA ATT CAG ATG 804 

Gly Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin He Gin Met 
195 200 205 

GTG AAG CAG ATT GGA AAA GGC CGC TAT GGC GAG GTG TGG ATG GGA AAG 852 
Val Lye Gin He Gly Lye Gly Arg Tyr Gly Clu Val Trp Met Gly Lya 
210 215 220 

TGG CGT GGA GAA AAG GTG GCT GTG AAA GTG TTC TTC ACC ACG GAG GAA 900 
Trp Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu 
225 230 235 

GCC AGC TGG TTC CGA GAG ACT GAG ATA TAT CAG ACG GTC CTG ATG CGG 948 
Ala Ser Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg 
240 245 250 

CAT GAG AAT ATT CTC GGC TTC ATT GCT CCA CAT ATC AAA GGC ACT GGC 996 
His Glu Aan He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly 
255 260 265 270 

TCC TGG ACT CAG TTC TAC CTC ATC ACA GAC TAT CAT GAA AAC GCC TCC 1044 
Ser Trp Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser 
275 280 285 

CTT TAT GAC TAT CTC AAA TCC ACC ACC TTA CAC CCA AAC TCC ATC CTC 1092 
Leu Tyr Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu 
290 295 300 

AAC CTA GCC TAC TCC TCT CTC AGC CGC CTA TCC CAT TTA CAC ACC GAA 1140 
Lys Leu Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu 
305 310 315 

ATC TTT AGC ACT CAA CGC AAG CCA GCA ATC GCC CAT CGA GAC TTG AAA 1188 
He Phe Ser Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys 
320 325 330 

AGT AAA AAC ATC CTG GTG AAG AAA AAT GGA ACT TCC TCC ATA GCA GAC 1236 
Ser Lya Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Aep 
335 340 345 350 

CTG GGC TTC CCT CTC AAG TTC ATT AGT GAC ACA AAT GAG CTT GAC ATC 1284 
Leu Gly Leu Ala Val Lys Phe He Ser Asp Thr Aan Glu Val Asp He 
355 360 365 

CCA CCC AAC ACC CGG CTT GGC ACC AAG CGC TAT ATG CCT CCA GAA GTG 1332 
Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Clu Val 
370 375 380 

CTG GAC GAC ACC TTG AAT ACA AAC CAT TTC CAG TCC TAC ATT ATG GCT 1380 
Leu Asp Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr He Met Ala 
385 390 395 
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GAC ATC TAC ACC TTT CCA CTC ATC CTC TGC GAO ATT CCA AGO AGA TGT 1428 
Asp Met: Tyr Ser Phe Gly Leu lie Leu Trp Glu lie Ala Arg Arg Cys 
400 405 410 

GTT TCT GGA GGT ATA CTC GAA GAA TAC CAG CTT CCC TAT CAC GAC CTG 1476 
Val Ser Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 
415 420 425 430 

GTG CCC ACT GAC CCT TCT TAT GAG GAC ATG AGA GAA ATT CTG TGC ATC 1524 
Val Pro ser Asp Pro Ser Tyr Glu Asp Met Arg Glu lie Val Cys Hot 
435 440 445 

AAG AAG TTA COG CCT TCA TTC CCC AAT CCA TGG AGC ACT GAT GAG TGT 1572 
Lys Lys Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys 
450 455 460 

CTC AGG CAG ATG GGG AAG CTT ATG ACA GAG TGC TGG CCC CAG AAT CCT 1620 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro 
465 470 475 

GCC TCC AGG CTG ACG GCC CTG AGA GTT AAG AAA ACC CTT GCC AAA ATG 1668 
Ala Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 
480 485 490 

TCA GAG TCC CAG GAC ATT AAA CTC TGACGTCAGA TACTT G TGGA CAGAGCAAGA 1722 
Ser Glu Ser Gin Asp lie Lys Leu 
495 500 

ATTTCACAGA AGCATCGTTA GCCCAAGCCT TGAACGTTAG CCTACTGCCC AGTCAGTTCA 1782 

GACTTTCCTG GAAGAGAGCA CGGTGGGCAG ACACAGAGGA ACCCAGAAAC ACGGATTCAT 1842 

CATGGCTTTC TGAGGAGGAG AAACTGTTTG GGTAACTTGT TCAAGATATG ATGCATCTTG 1902 

C TTTCT A AGA AAGCCCTGTA TTTTGAATTA CCATTTTTTT ATAAAAAAAA 1952 



<2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys Lys Glu 
1 5 10 15 



Asp Gly Glu Ser Thr Ala Pro Thr 
20 

Lys Cys His His His Cys Pro Glu 

35 40 

Thr Asp Gly Tyr Cys Phe Thr Met 
50 55 



Pro Arg Pro Lys lie Leu Arg Cys 
25 30 

Asp Ser Val Asn Asn lie Cys Ser 
45 

lie Glu Glu Asp Asp Ser Gly Met 
60 
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Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Aep Phe Gin 
65 70 75 80 

Cys Arg Aep Thr Pro lie Pro His Gin Arg Arg Ser lie Glu Cys Cys 
85 90 95 

Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu Pro Pro 
100 105 110 

Leu Lya Asp Arg Asp Phe Val Asp Gly Pro lie His His Lys Ala Leu 
115 120 125 

Leu lie Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu lie lie Leu 
130 135 140 

Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg Tyr Ser 
145 150 155 160 

lie Gly Leu Glu Gin Asp Glu Thr Tyr lie Pro Pro Gly Glu Ser Leu 
165 170 175 

Arg Asp Leu lie Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu 
180 185 190 

Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Lys 
195 200 205 

Gin He Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Txrp Arg 
210 215 220 

Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser 
225 230 235 240 

Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu 
245 250 255 

Asn He Leu Gly Phe He Ala Ala Aep He Lys Gly Thr Gly Ser Trp 
260 265 270 

Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr 
275 280 285 

Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu 
290 295 300 

Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu He Phe 
305 310 315 320 

Ser Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys 
325 330 335 

Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu Gly 
340 345 350 

Leu Ala Val Lys Phe He Ser Asp Thr Asn Glu Val Asp He Pro Pro 
355 360 365 

Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val Leu Asp 
370 375 380 
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Glu Ser Leu Aan Arg Aan Hia 

385 3S0 

Tyr Ser Phe Gly Leu lie Leu 
405 

Gly Gly lie Val Glu Glu Tyr 
420 



Ser Aap Pro Ser Tyr Glu Aap 
435 



Leu Arg Pro Ser Phe Pro Aan 
450 455 

Gin Met Gly Lya Leu Met Thr 
465 470 

Arg Leu Thr Ala Leu Arg Val 
485 



Ser Gin Aap lie Lya Leu 
500 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCCGATCCTG TTGTGAAGGN AATATGTG 28 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: a ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



Phe Gin Ser 



Trp Glu lie 
410 



Gin Leu Pro 
425 

Met Arg Glu 
440 



Arg Trp Ser 



Glu Cya Trp 



Lya Lya Thr 
490 



Tyr lie Met 
395 

Ala Arg Arg 



Tyr Hia Aap 



lie Val Cya 
445 

Ser Aap Glu 
460 

Ala Gin Aan 
475 

Leu Ala Lya 



Ala Aap Met 
400 

Cya Val Ser 
415 

Leu Val Pro 
430 

Met Lya Lya 



Cya Leu Arg 



Pro Ala Ser 
480 

Met Ser Glu 
495 



SUBSTTTUTE SHEET 



wo 94/11502 






PCr/GB93/02367 



80 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GCGATCCGTC GCAGTCAAAA TTTT 



24 



(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 
(iii> HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCGGATCCGC GATATATTAA AA6CAA 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGGAATTCTG GTGCCATATA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



ATTCAAGGGC ACATCAACTT CATTTGTGTC ACTGTTC 



37 



(2) INFORMATION FOR SEQ 10 NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOZXXZY: linear 

(ii) MOLECUU: TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCGGATCCAC CATGGCGGAG TCGGCC 26 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACACCGCGC CCGCGATGAT 20 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(V) FRAGMENT TYPE: internal 

(xt) SEQUEKCB DESCRIPTIONS SEQ ID NO: 26 » 

Gly Xaa Gly Xaa Xaa Gly 

1 5 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQXJENCK DESCRIPTION: SEQ ID NO: 27: 

Asp Phe Lya Ser Arg Aen 

1 5 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asp Leu Lys Ser Lys Asn 
1 5 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Gly Thr Lys Arg Tyr Met 

1 5 
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CLAIMS 

1. An isolated protein having a serine/ threonine kinase 
domain, a DFKSRN or DLKSKN sequence in subdomain VIB and/or 
a GTKRYM sequence in subdomain VIII. 
5 2. A protein according to claim 1, which additionally 
comprises an ATP-binding sequence that is Gly-Xaa-Gly-Xaa- 
Xaa-Gly in subdomain I, and a Lys residue in subdomain II. 

3. An isolated protein having a serine/ threonine kinase 
domain which has more than 50% identity to the kinase 

10 domain of any of the amino-acid sequences identified herein 
as SEQ ID. Nos. 2, 4, 6, 8, 10, 12, 14, 16 and 18. 

4. A protein according to claim 3, wherein the identity 
is more than 60%. 

5. A protein according to any preceding claim, having 
15 serine/ threonine kinase activity. 

6. An isolated protein having all or part of any of the 
amino-acid sequences identified herein as SEQ. ID Nos. 2, 
4, 6, 8, 10, 12, 14, 16 and 18, and activin receptor type 
I functionality. 

20 7. An isolated protein having an amino-acid sequence 
corresponding to part or all of the eunino-acid sequence of 
an activin type I receptor, and wherein the protein has at 
least one of the following characteristics : - 

(i) serine/threonine kinase activity; 
25 (ii) activin-binding activity; and 

(iii) activin type II receptor interaction. 

8. An isolated protein having all or part of any of the 
amino-acid sequences identified herein as SEQ. ID Nos. 2, 
4, 6, 8, 10, 12, 14, 16 and 18, and TGF-B-type I receptor 

30 functionality. 

9. An isolated protein having an amino-acid sequence 
corresponding to part or all of the amino-acid sequence of 
a TGF-6-type I receptor, and wherein the protein has at 
least one of the following characteristics: 

35 (i) serine/ threonine kinase activity; 

(ii) TGF-fl-binding activity; and 

(iii) TGF-B-type II receptor interaction. 
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10. A protein according to any of claims 1 to 5, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 2. 

11. A protein according to any of claims 1 to 7, having 
5 all or part of the eunino-acid sequence identified herein as 

SEQ ID No. 4. 

12. A protein according to any of claims 1 to 5, having 
serine/threonine kinase activity and all or part of the 
amino-acid sec[uence identified herein as SEQ ID No. 6. 

10 13. A protein according to any of claims 1 to 7, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No, 8, 

14. A protein according to any of claims 1 to 5, 8 and 9, 
having all or part of the amino-acid sequence identified 

15 herein as SEQ ID No. 10. 

15. A protein according to any of claims l to 5, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 12. 

16. A protein according to any of claims 1 to 5, having 
20 all or pairt of the amino-acid sequence identified herein as 

SEQ ID No. 14. 

17. A protein according to any of claims l to 7, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 16. 

25 18. A protein according to any of claims l to 5, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 18. 

19. A protein acording to any preceding claim, that is a 
soliible receptor. 
30 20. An antibody which binds specifically to a protein as 
defined in any of claims l to 19 and not to at least one 
other such protein. 

21. An isolated nucleic acid molecule which codes for, or 
is complementary to a nucleic acid molecule which codes 

35 for, a protein as defined in any of claims l to 19. 

22. A recombinant nucleic acid molecule comprising at 
least two heterologous sequences, one of which codes for, 
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or is complemen'tary to a nucleic acid molecule which codes 
for, a protein as defined in any of claims 1 to 19. 
23. A molecule according to claim 21 or claim 22, wherein 
the protein is a TGF-B-type I receptor. 
5 24. A molecule according to claim 21 or claim 22, wherein 
the protein is an activin receptor. 

25. A DNA or RNA/mRNA molecule according to any of claims 
21 to 24. 

26. A molecule according to any of claims 20 to 24, which 
10 additionally comprises, operably associated with the coding 

sequence, a sequence adapted to allow expression of the 
protein. 

27. A host comprising a molecule according to claim 26, 
which is capable of expressing the protein. 

15 28. A host according to claim 27, which comprises PAE 
cells. 

29. A host according to claim 27 or claim 28, transfected 
with the Chim A receptor plasmid. 

30. A product according to any preceding claim, for 
20 therapeutic or diagnostic use. 

31. Use of a product according to any of claims 1 to 29, 
for the manufacture of a medicament for use in treating a 
condition associated with TGF activity. 
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cons.aa G 'G G V A K E 

hTGFBR- 1 1 Lim.VGKGRFAEVYKAiajCQirreEQFET\n^VKIIT>YDHYASWKDRia3IF^INIJ^ 

mACCR- IIB LLEIKARGRFGCVWKAQU4N DFVAVKIKPIiQDKQSWOSEREIFSTPGMKHENLIiQF 

mACCR- II LliEVKARGRJFXSCVWKAQlJjN EYVAVKIFPIODKOSWONEYEVYSIPCMKHENIIOF 

da/-l LlxHlVGSGRFGNVSRGDYRG EAVAVKVFNAIDEPAFHKEIEIFETRMLRHPNVUIY 

subdomains i II m iv 



hTGFBR- 1 1 LTAEERKTEIiGKQYVrtilTAFHAKGNIjOEyiiTRHVISWEDLJUn/GSSIJUlGLSHLHSDHTP - C 
mAct R - IIB I AAEKRGSNLEVELWIilTAFHDKGSLIDYliKGNI ITWNELC2fVAElMSRGISYLHEDVPWCR 
mActR-II IGAEKRGTSVDVDIAtflilTAFHEKGSbSDFLKANWSWNELCHIAETOARGIjA 
daf - 1 IGSDRVDTGFVTELWIiVTEYHPSGSIJn)FIXENTVNIETYTlTI24RSTASGIJ^^ 
subdomains v vi-A 



cons.aa dlk n dpg 

hTGFBR- 1 1 -GRPKMPIVHRDIiKSSNILVKNDLTCCLCDPGUSLRI. GPYSSVDDLANSGQVGTARYMTO* 

mACCR- IIB GEGHKPSIAHRDFKSKNVIiLKSDLTAVLADPGLAVRF EPGKPPGD- - THGQVGTRRYMAP 

mACCR - 1 1 - DGHKPAISHRDIKSKNVtiJCNNI/rACIADFGLALJCF EAGKSAGD - - THGQVGTRRYM7U> 

daf - 1 - ESKKPAMAHRDIKSKNIMYKNDLTCAIGDLGLSLSKPEDAASDI I AN - - ENYKCGTVRYLAP 

subdomains vi-B VII VIII 



Fig. 1 
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a. a C C E G N M C 

5' GCGGATCCTGTTGTGAAGGNAATATGTG 3' Fig. 2A 
BAMHl C C G C 



a. a V A V K I F 

5' GCGGATCCGTCGCAGTCAAAATTTT 3' Fig. 2B 

BeunHI G C G G C 

T T T A 



a. a R D I K S K N 

5' GCGGATCCGCGATATTAAAAGCAA 3' Fig. 2C 

BAMHI A C C GTCT 
G A 



a. a EPA MY 

5' CGGAATTCTGGTGCCATATA Fig. 2D 

EcoRI G G G 
A A 
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11 



ActR-ll (513 a.a.) 1 1 I 



11 



ActH-IIB(S36a.a.) ■!- ■- IH^RT 
T8R-II (567 a.a.) ■ I 



TBR-l/ALK-5 (503 a.a.) B| ' ^ ^ 1 ^ 



ALK-1 (503 a.a.) 
ALK-2 (509 a.a.) 



n~~r 



ALK-3(532a.a.) ■ I I ^ VMmm. 



ALK-4 (505 a.a-) 



ALK-6 (502 a.a.) T I ^ ^ 




signal sequence 
cysteine-rich region 
transmembrane domain 
serineAhreonine kinase domain 



insert in the kinase domain 
^ potential N-glycosylation site 
^ alternatively spliced region 



Fig. 4 
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LUDWIG INSTITUTE FOR CANCER RESEARCH et al. 





I. This international preliminary examination report has been prepared by this International Preliminary Examining 
Authority and is transmitted to the applicant according to Article 36. J-reuminary bxamining 

sheets, including this cover sheet. 



applicant according to Article 36 
2. This REPORT consists of a total of 



^ IJln '/rS^nnl'/inH ^"^"'I'Pf ^,y ANNEXES, i.e., sheets of the description, claims and/or drawings which have 

(see\'uT 70 aJd SecSon ?07 of'tJ^V^'P"'' containing rectifications made before this Authoritr 

isee Kuie /u. it> and Section 607 of the Administrative Instructions under the PCT). 

These annexes consists of a total of sheets. 



3. This report contains indications and corresponding pages relating to the following items: 
I Basis of the report 

II Q Priority 

HI ^ Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

^ ^ c^fa^on^l.nH^ll'^f"' T'^^' Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement ^ ' 

VI [[J Certain documents cited 

VII Certain defects in the international application 
VIII Q Certain observations on the international application 
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Date of completion of this report 



t7. 



02.95 



Name and mailing address of the I PEA/ 
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-llll D-80298 Munich 
WJI Tel. ( + 49-89) 2399-0, Tx: 523656 epmu d 
, Fax: (4-49-89) 2399-4465 . 




Form PCT/I PEA/409 (cover sheet) (January 1994) 



Authorized officer 



Telephone Noj 



(30/08/1994) 




INT££tNATIONAL PRELJMINAH EXAMINATION REPORT 



Jntern. application No. 
^CT/GB93/02 367 



I. Basis of tbe report 



1. This report has been drawn up on the basis of (Replacenent sheets which have been furnished to the receiving 
Office in response to an invitation under Article 14 are referred to in this report as "originally filed" and are 
not annexed to the report since they do not contain amendments.): 



the international application as originally filed. 



[X] the description, pages 1-34, 35-82 (sequence listing). 

pages 

pages 

pages 



as originally filed, 
filed with the demand, 
_, filed with the letter of 
_, filed with the letter of 



[x] the claims, Nos. 10-31. 

Nos. 

Hos. 



Hos. 1-9. 
Nos. 



as originally filed, 
_, as amended under Article 19, 

filed with the demand, 
_, filed with the letter of 23/01/95, 

filed with the letter of , 



[x] the drawings, sheets /fig l/8-8/8_ 

sheets/fig 

sheets/fig 

sheets/fig 



., as originally filed, 
_, filed with the demand, 

filed with the letter of 
_, filed with the letter of 



2, The amendments have resulted in the cancellation of: 

[ ] the description, pages 

[ ] the claims, Nos. 



] tbe drawings, sheets /fig 



3. [ ] This report has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 



4. Additional observations, if necessary: 
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Intern, application No. 
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III. Hon-establislaent of opinion with regard to novelty, inventive step and industrial applicability 

The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non-obvious), or 
to be industrially applicable have not been and will not be examined in respect of: 

[ ] the entire international application, 

[X] claims Hos. 1-31 



because: 

[ ] the said international application, or the said claims Mos. — relate 

to the following subject matter which does not require an international preliminary examination (specify): 

[x] the description, claims or drawings (indicate particular elements below) or said claims 

Nos. 1-31 are so unclear that no meaningful opinion could be formed 

(specify): 

Article 6 PCT rec[uires that the claims shall be concise; 
this refers to individual claims as well as to the 
claims in their entirety (see the PCT-Guidelines , CIII, 
5.1). 

Moreover, Article 6 PCT taken in combination with Rule 
6(3) (b) PCT requires that any independent claim must 
contain all the technical features essential to the in- 
vention, i.e. all those features which distinguish the 
claimed subject-matter from sxib ject-matter disclosed in 
the prior art. 

The present invention is set out in six independent 
claims relating to isolated proteins. These six claims 
provide six differently worded, non-analogous defini- 
tions (see Claims 1, 3, 6-9). There is no non-obvious, 
technical feature which is common to all these alterna- 
tive definitions. This presentation makes it impossible 
to determine what the essential technical features are 
of the matter for which protection is sought. Therefore, 



Form PCT/IPEA/409 (sheet 2) (January 1994) 



^Intern, application No. 
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the present set of claims does not meet the 
clarity-requirements of Article 6 PCT. 



[ ] the claiffls, or said claims Nos. ; are so inadequately supported by 

the description that no meaningful opinion could be formed. 

[x] no international search report has been established for said claims 
Nos. 1-31 . 
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IV. Lack of unity of invention 

1. In response to the invitation to restrict or pay additional fees the applicant has: 

[ ] restricted the claims. 

[ ] paid additional fees. 

[ ] paid additional fees under protest. 

[X] neither restricted nor paid additional fees. 

2. [X] This Authority found that the requirement of unity of invention is not conplied with and chose, 

according to Rule 68.1, not to invite the applicant to restrict or pay additional fees; 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 
13.3 is 

[ ] complied with. 

[X] not conplied with for the following reasons: 

The invention as defined in the claims lacks unity. This 
objection concerns the alternative forms defined in each 
of the 6 independent Claims 1, 3, 6-9 as such as well as 
alternatives referred to in single claims (Claims 1, 8, 
9) . 



For example: 

Claim 1 refers to : 

(a) an isolated protein having a receptor 
serine/threonine kinase domain corresponding to that in 
daf-1 gene product, activin type II receptor and TGF-6 
type II receptor, a DFKSRN sequence in subdomain VIB of 
said domain and a GTKRYM sequence in subdomain VIII of 
said domain. 

(b) an isolated protein having a receptor 
serine/threonine kinase domain corresponding to that in 
daf-1 gene product , activin type II receptor and TGF-6 
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Intern, application No. 
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type II receptor, a DFKSRN sequence in subdomain VIB of 
said domain or a GTKRYM sequence in subdomain VIII of 
said domain. 

(c) an isolated protein having a receptor 
serine/threonine kinase domain corresponding to that in 
daf-1 gene product, activin type II receptor and TGF-6 
type II receptor, a DLKSKN sequence in subdomain VIB of 
said domain and a GTKRYM sequence in subdomain VIII of 
said domain. 

(d) an isolated protein having a receptor 
serine/threonine kinase domain corresponding to that in 
daf-1 gene product, activin type II receptor and TGF-6 
type II receptor, a DLKSKN sequence in subdomain VIB of 
said domain q£ a GTKRYM sequence in subdomain VIII of 
said domain. 

Proteins having a receptor serine/threonine kinase do- 
main corresponding to that in daf-1 gene product, 
activin type II receptor and TGF-6 type II receptor are 
known (see page 3 of the application); therefore, this 
characteristic cannot be regarded as the common inven- 
tive idea linking all the alternatives. 

The remaining features characterizing the claimed prod- 
ucts are in case of 
(a) a DFKSRN and a GTKRYM sequence 
(bl) a DFKSRN sequence 
(b2) a GTKRYM sequence 
(c) a DLKSKN and a GTKRYM sequence 
(di) a DLKSKN 
(d2) a GTKRYM sequence 

It is quite clear from this enumeration that only (b2) 
and (d2) are linked by a common feature whereas the 
other alternatives do not share common features. There- 
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fore. Claim 1 lacks unity - 

Similar considerations apply to Claims 8 and 9 as well 
to the relation between all independent claims. 



It is noted that the proteins of the application, i.e. 
ALK 1-6 appear to share common features; as it is ex- 
plained above, this is however not true for the proteins 
as they are defined in the claims. 

4. Consequently, the following parts of the international application were the subject of international prelininary 
examination in establishing this report: 



[ ] all parts. 

[ ] the parts relating to claims Nos. 
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1. An isolated protein having a receptor serine/ threonine 
Kinase domain corresponding to that in daf-l gene product, 
activxn type II receptor and TGF-/3 type il receptor, a 
DFKSRM or DLKSKN sequence in subdomain VIB of said domain 
and /or a GTKRYM sequence in subdomain VIII of said domain. 

2. A protein according to claim 1, which additionally 
comprises an ATP-binding sequence that is Gly-Xaa-Gly-Xaa- 
Xaa-Gly in subdomain I of said domain, and a Lys residue in 
subdomain II of said domain. 

3. An isolated protein having a GS box and a receptor 
serine/threonine kinase domain which has more than 50% 
identity to the kinase domain of any of the amino-acid 
sequences identified herein as SEQ ID. Nos. 2, 4, 6, 3, 10, 
12 r 14, 16 and 18. 

4 . A protein according to claim 3 , wherein the identity 
is more than 60%. 

5. A protein according to any preceding claim, having 
serine/threonine kinase activity. 

6. An isolated protein having all or part of any of the 
amino-acid sequences identified herein as SEQ. ID Ncs. 2, 
4, 6, 8, 10, 12, 14, 16 and 18, and activin receptor type 
I functionality. 

7. An isolated protein having a GS bojc and an amino-acid 
sequence corresponding to part or all of the amino-acid 
sequence of an activin type I receptor, and wherein the 
protein has serine/threonine kinase activity and/or activin 
type II receptor interaction providing activin-binding 
activity. 

8. An isolated protein having a GS box and all or part of 
any of the amino-acid sequences identified herein as SEQ. 
ID Nofi. 2, 4, 6, 8, 10, 12, 14, 16 and 18, and TGF-fl-type 
I receptor functionality. 

9. An isolated protein having an amino-acid sequence 
corresponding to part or all of the amino-acid sequence of 
a TGF-fi-type I receptor, and wherein the protein has 
serine/ threonine kinase activity and/or TGF-fl-type II 
receptor interaction providing TGF-jS-binding activity. 



AMENDED SHEET 



PCT/GB93/02367 



PATENT COOPERATION TREATY 

# 



From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION CONCERNING 
DOCUMENT TRANSMITTED 


To: 

United States Patent and Trademark 

Off irp 
(Box PCT) 

Washington D.C. 20231 
United States of America 

in Its capacity as elected Office 


Date of mailing: 

21 March 1995 (21.03.95) 




International application No.: 

PCT/G 893/02367 


International filing date: 

17 November 1993 (17.11.93) 


Applicant: 

LUDWIG INSTITUTE FOR CANCER RESEARCH et a! 




The International Bureau transmits:herewith the"f6llowing documents and number thereof: 



copy of the.internafional preliminary examination report and annexes (Article 36(3)(a)) 





The International Bureau of WlPO 
34, chemin des Colombettes 
121 1 Geneva 20, Switzerland 

Facsimile No.: (41-22)740.14.35 


Authorised officer: 
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^ikxENT COOPERATION TREAi 

PCT 
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PCT 
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Applicant's or agent's file reference 
70/4201/03 
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International application No. 
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This international preliminary examination report has been prepared by this International Preliminary Kxamining 
Authority and is transntilied to the. applicant according to Article 36. 



This REPORT consists of a total of ^ 



sheets, including this cover sheet. 



This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCZT). 



These annexes consists of a total of 



sheets. 



3. This report contains indications and corresponding pages relating to the following items: 

I fx] Basis of the report 
II Priority 

HI Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

IV [ I Lack of unity of invention 

V [ I Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement — 

VI I I Certain docoments cited 

VII [ I Certain defects in the international application 
VIII I I Certain ob.servations on the international application 
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INTERNATIONAL PRELIMINlBfy EXAMINATION REPORT 



|Intern. application No. 
PCT/GB93/02367 



I. Basis of the report 



1. This report has been drawn up on the basis of (Replaceient sheets which have been furnished to the receiving 
Office in response to. an invitation under Article 14 are referred to in this report as "originally filed" and are 
not annexed to the report since they do not contain amenduents.): 



the international application as originally filed. 



[X] the description, pages 1-34, 35-82 (sequence listing). 

pages 

pages 

pages 



., as originally filed, 
_, filed with the demand, 
_, filed with the letter of 
_, filed with the letter of 



[X] the claims, Kos. 10-31. 

Nos. 

Hos. 



Nos. 1-9. 
Nos. 



as originally filed, 
_, as amended under Article 19, 
_, filed with the demand, 
_, filed with the letter of 23/01/95, 
_, filed with the letter of , 



[X] the drawings, sheets/fig 1/8-8/8. 

sheets/fig 

sheets/fig 

sheets/fig 



as originally filed, 
_, filed with the demand, 
_, filed with the letter of 
_, filed with the letter of 



2. The amendments have resulted in the cancellation of: 

[ ] the description, pages 

[ ] the claims, Hos. 

[ ] the drawings, sheets/fig 



3. [ ] This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 
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III. Ron-establishient of opinion with regard to novelty, inventive step and industrial applicability 



The questions whether the claioed invention appears to be novel, to involve an inventive step (to be non-obvious), or 
to be industrially applicable have not been and will not be examined in respect of: 

[ ] the entire international application) 



[x] claims Nos. 1-31. 



because: 

[ ] the said international application, or the said claims Nos. relate 

to the following subject matter which does not require an international preliminary examination (specify): 

[X] the description, claims or drawings (indicate particular elements below) or said claims 

Nos. 1-31 are so unclear that no meaningful opinion could be formed 

(specify): 

Article 6 PCT requires that the claims shall be concise; 
this refers to individual claims as well as to the 
claims in their entirety (see the PCT-Guidelines , CIII, 
5.1). 



Moreover, Article 6 PCT taken in combination with Rule 
6(3) (b) PCT requires that any independent claim must 
contain all the technical features essential to tITe in- 
vention ^ i.e. all those features which distinguish the 
claimed subject-matter from subject-matter disclosed in 
the prior art. 

The present invention is set out in six independent 
claims relating to isolated proteins. These six claims 
provide six differently worded, non-analogous defini- 
tions (see Claims 1, 3, 6-9). There is no non-obvious, 
technical feature which is common to all these alterna- 
tive definitions. This presentation makes it impossible 
to determine what the essential technical features are 
of the matter for which protection is sought. Therefore, 
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the present set of claims does not meet the 
clarity-requirements of Article 6 PCT. 



[ ] the claims, or said clains Nos. are so inadequately supported by 

the description that no leaningful opinion could be formed. 

[ ] no international search report has been established for said claims 
Nos. 
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1. An isolated protein having a receptor serine/ threonine 
kinase domain corresponding to that in daf-1 gene product, 
activin_type IX receptor and TGF-/3 type II receptor, a 
OFKSRN or DUCSKN sequence in subdomain VIB of said domain 
and/or a GTKRYM sequence in subdomain VIII of said domain. 

2. A protein according to claim 1, which additionally 
comprises an ATP-binding sequence that is Gly-Xaa-Giy-Xaa- 
Xaa-Gly in subdoraain I of said domain, and a L,ys residue in 
subdomain II of said domain. 

3. An isolated protein having a GS box and a receptor 
serine/threonine kinase domain which has more than 50% 
identity to the kinase domain of any of the amino— acid 
sequences identified herein ~s: SEQ ID. Nos. 2, 4, 6, 8, 10, 
12 r 14, 16 and 18. 

4 . A protein according to claim 3 , wherein the identity 
is more than 60% . 

5. A protein according to any preceding claim, having 
serine/threonine kinase activity. 

6. An isolated protein having all or part of any of the 
amino-acid sequences identified, herein as SEQ. ID Nos. 2, 
4, 6, 8; 10, 12, 14, 16 and 18, and activih receptor type 
I functionality. 

7. An isolated protein having a GS box and an amino^*acid 
sequence corresponding to part or all of the amino^cid 
sequence of an activin type I receptor, and wherein the 
protein has serine/threonine kinase activity and/or activin 
type II receptor interaction providing activin-binding 
activity. 

8. An isolated protein having a GS box and all or part of 
any of the amino-acid sequences identified herein as SEQ. 
ID Nos. 2, 4, 6, 8, 10, 12, 14, 16 and 18, and TGF-fi-type 
I receptor functionality. 

9. An isolated protein having an amino-acid sequence 
corresponding to part or all of the amino-acid sequience of 
a TGF-S-type I receptor, and wherein the protein has 
serine/ threonine kinase activity and/or TGF-B-type II 
receptor interaction providing TGF-^S -binding activity. 
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United States Patent and Trademark 
Office 
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in Its capacity as elected Office 


Date of mailing: 

Z 1 July iyy4 (^1.07.94) 


International application No.: 

PCT/GB 93/02367 


Applicant's or agent's file reference: 
70/4201/03 


International filing date: 

17 November 1993 (17.11.93) 


Priority date: 

17 November 1992 (17.11.92) 


Applicant: 

MIYAZONO, Kohei et al 



1. The designated Office is hereby notified of its election made: 

in the demand filed with the International Preliminary Examining Authority on: 



X 



09 June 1994 (09.06.94) 



I I in a notice effecting later election filed with the International Bureau on: 



2. The election | X | was 

□ 



was not 



made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 



The International Bureau of WlPO 


Authorized officer: 




34, chemin des Colombettes 






1211 Geneva 20. Switzerland 


B. Schmitt 




Facsimile No.: (41-22)740.14.35 


Telephone No.: (41-22)730.91.11 




Form PCT/IB/331 (July 1992) 




496650 



REQUEST 



p0r 



The undersigned rwjuesu ihtt the present 

international application be processed 
according to the Patent Cooperation Treaty. 



rcc«i*«ni Offk* u$« only 



Iniertvaiional Filim Dau 



Name of receivinf Office and "PCT tniemaiional Applicatioo' 



Applicant's or a|em'i file rererence 
(ifdtiirtd) (12 tharaettn muamumi 



70/4201/03 



Box No. I TfTLE OF INVENTION 

PROTEINS HAVING SERINE/THREONINE KINASE DOMAINS, CORRESPONDING 
NUCLEIC ACID MOLECULES. AND THEIR USE 



Bex No. II APPLICANT 



Name and addreis: iFamilx namthfto^td by |/v«i *am*: foraUtai win. M effidal 
name ana 'j^^j^^ig^ jIuoddrtumSst include postdcodt and nam of COmtry.} 

LUDWIG INSTITUTE FOR CANCER RESEARCH 

St. Mary's Hospital Medical School 

Norfolk Place 

Paddington 

London W2 IPG 

United Kingdom 



I I This person is also inventor. 



Telephone No. 



Facsimile No. 



TekprimcrNo. 



Suie (Le. country) of naiioniliiy: 
United Kingdom 


Sute (It. country) of res 
United King 


idence: 
dom 




Box No. Ill FURTHER APPLICANTS AND/OR (FURTHER) INVENTORS 





Name and address: ff<wu7v name followed by f ivtn name: for a utaienitn. m cspaai 
r.anic ano •oorw*. '^j,-^,,^^*^. The oddreu misi include postal codt and name cf country.) 

MIYAZONO, Kohei 
Flogstavagen 63D 
S-752 63 Uppsala 
Sweden 



This person is: 
j I applicant only 

|X j ipplicant and inventor 

I I invemoronly f(^rt«c**ft-lttt 
*— • is marked, do not fill in below.) 



Suie ftc country) of nationality: 
Japan 



State fte. country; of lesidence: 
Sweden 



This person is applicant 
for ihe purposes of: 



□ all ddiinaied n iei^ii^iti Slates except 
* I ItheUnittdSiaiwofAmenca 



r--| the United States 
1 X| of Amertca only 



□ the Sutes iikJicaurf ia 
the Supplcmewal Boa 



Name and address: (Famih name Mowed tv |fvrii name: for eUtal entity. MjO^tjoi 
tunc ana aooress. J^,:^;^,;^ address mist include postal code end name ofctmoy.) 

DUKE, Peter ten 
Flogstavagen 25C 
S-752 63 Uppsala 
Sweden 



This person is: 
I I epplictnt only 

applicant and inventor 

r~l inventor only Of this ched-hot 
UmartetLdfi not fill in below.) 



State (i.e. luiiiury) of nationality: 

Netherlands 



Sute (i.e. countn) of (tsidence: 
Sweden 



Thtii pervon is applicant 
for the p\irposes of: 



□ all dc»ignaie<5 I — I all <J"i.l Sutei ewep 
Jute* I J th« United StaiM of Amcnca 



aihc t'niird States 
of America only 



□ 



the Siaie« indicsteJo 
(he Suppteinenul 6.-t 



Continuation of Bos No. 



ShwNo. ..... 

RTHER APPUCANTS ANDADR (Fim 



4vroRS 



If MM ofthtfoUomni tu^boxtt U uttd, this thttt is not u b4 imetudti in iht nqutn. 



Name and adOrt*»: (Fanily itamtMlo^td by wtt: for « /»|o/ tntin. /W' 

FRAN2EN, Petra 
Lindsbergsgatan 15b 
S-752 40 Uppsala 
Sweden 



Thi* person b: 
{ I applicant only 

apptksM and inventor 

I I inventor only (If this chtti-bot 
' if morUd. 4o not fill in btlow.) 



1 State <Lt. ceuiuryi of naiionaliiy: 
1 Sweden 


Siatt (it. twniry) of residence: 

Sweden 


1 ThU person is applicaa | — 1 alt (mipaud I — 1 all dwiputerf $ut« wct^ rri 
1 for the purposes oft LJ Staw LJ «»« S^**" Amenca ULIof 


United Sums r~1 ihc Suies indicated ia 
America only 1 i (he Supplcnienial Boa 



Name and address: (Famih HcnuMowtd by iiven nam*; fi>r a Uioltntiry. M officicl 
dtsitnhtion. Thtoddreu must include postal code and namtefeouury.l 

YAMASHITA, Hidetoshi 

Flogstavagen 3 3A - ' 

S-752 63 Uppsala 
Sweden 



State (Le. ccu/urj) of nationality: 

Japan 



This person is: 
[~] applicam only 

j ^ applicant and inventor 

I j inventor only (If this check-box 
it marktide not fill in below.) 



State (Le. country) of residence: 

Sweden 



□ the Uniied Stales 
of America only 



□ tfte States indicated ia 
the Supplemental Boi 



Name and address: (Family nameMowed by |n»n name: for « legal entiry, M officii 
i—s^ Jfci,|nfl/io«. The address mutt include postal code and name cfcowiuy.) 

HELDIN, Carl-Henrik 

Hesselmans vag 35 

S-752 63 Uppsala 

Sweden 



This person ic 
I I applicant only 

I X] applicant and inventor 

I j inventor only (If this check-box 
' a marked, do not Jill in below.) 



State fi-e. country) of nationality: 

Sweden 



State (Le. country) of residence: 
Sweden 



This oersoo is aoolioDt I — I all detiinaied f — I all de»i|n»»ed Suiet eieeyt P-l ** United Staua p-l the States indieattdm 
for tlS^puip<»«% risaSt *^ LJihe Uniied Sum of Aninca | XJ cf Amenca only LI Supplemental Boi 



Name and address (Family nameMlo*ed by tivtn name: for a Uialemity. Mfpcj^ 
name ang aoorew; Jfc,/|„S,i«i fhe address must include posud code and namt of ceuitry.) 



This person is: 

I I applicant and inventor 

rn inventor only (If this cheek-box 
is marked io not fiUinbelffo'.) 



State lie. couniryi of luiionality: 



[State lie. counin) of residence: 



i. person is applicant ni"^"*-*- Q f^S'sl^A'a □ STA^nS^.^^ □ 
• the purposics 01; i— * «-~» * m , , , 



purposics 



j— ] Funhcr applicants and/or (further) inventors ore indicated on another continuation sheet 



Furm PCT/RO/IOI (cominuaiion iheet) (July 1992) 



See S. :ts to the request form 



:N«.IV ACDTTOI 



mi 



MON UrftESOrTAllVti OK AD 



COUtlSrONDCNCI 



' of OMtpplicaA^i) tefoN Um co«p>iMt lftWM><»*« A*>oritt« m: 



GILL JENNINGS & EVERY 
Broadgate House 
7 Eldon Street 
London EC2M 7LH 
United Kingdom 



071 377 1377 



PHdaOtHi. 

^71 377 1310 



TtlcprtoivfN*- 

22765 



r-n M.rk thi. Aect bo* whm i>o ty nt or coeamon reprynUliv, i»A«t btet .ppoimed <txHhe »p>ce above U uk4 io^ Id 
n iridiatc «tped«l tddreo to whfch cowpoodeftce ihould be teat. 



Bos No.V DESIGNATION OP STATES 



•n* fono^Bi dciiMUoo. M bmby oud« Rule 4.9(.) (m,fi A, 4fpi»bU * ««rfa 

Reftloful PatMM , 
Ivl EP EuTODMnPttent ATAwtri«,BEBel|ium.CH««IUSwii*eH«ndindUeehte«ie.D.DEC«n^ 

NL NeiberitDdi, PT Portugil. SE Sweden, and »ny otbe» Sute whicb b * Cootrtdini Suie of the Eurapun Pate« 
Coaventioo end of ibe PCT 

□ OA OAPIPHent: Benin.BurldBtFMo.Omefoo«,Ceninl AfrkenRep«blic.ai»d.COo|<^ated1v« 
M.li. MiuHudU, Ni«er, Sene|il. Tojo. and my other Suie which U i member Suie of OAH tad . CootrKrbni SUK 

of the per Of other tuU t(pniec6<m or Umtmen ienrti, tpmjjr f» Sotted Bm) 

Nattoittt PtUnt pfcdter UaJ cfprcudion cr Iretmaa Jairei. ipmf, m ieOed limi: 



Q AT Austria 

[x] AM Australia 

Q BB Barbadot 

n BG Bulgaria 

□ BR Braiil 

□ BY Belawa 
ET) CA Canada 

n CHandU Switzerlaod and Uechtensleia 

n CZ Qxcfa Republic 

n DE GenDaojr 

Q DK Dettmarfc 

Q ES Spain 

□ PI Fiotand 

Q GB United KiDgdoa 

Q HU Huogaiy 

(3 JP Japan 

n KP Democratic People's Republic of Korea 



□ 
□ 
□ 
□ 

E] 
□ 
□ 
□ 
□ 
□ 
□ 
□ 
□ 



MN MoogolU 

MW Malawi 

NL Nelberlands 
NO Norway 

NZ New Zealand 

PL Poland 

PT Portugal 

RO Rooaoia 

RU Russian Fedeiatioo 

SD Sudan 
SE Swedes 

SK Slovakia ■ 

DA t^kraine 

US Uuted Su les of America 



□ VN Viet Nam 



[x] KR Republic of Koren 

Q KZ KazaUisUa ... 

Q LK SriUnka 

Q LU Luxembourg ... 

Q MG Madagascar 



Cbeck-boxca reserved for desigoatiag Suies (for the purpoca of 

at pa " " 

}uuaDoe of this sheet: 



a natioQat paleot) wh!di have beome party to 6e' PCX afkr 



□ 
□ 
□ 
□ 



|n»ddlUontothedestgT.«tk>mm«dcaboY«.theapplica»t»!»om»k«s»nderRute4.9(b)andeai gn.tio^^ 

jjndei the PCT except ;>>t,<i?ig"igg?f^,Y ^;-^ 

it) 




Form PCr/RO/lOl (secood sheet) (July lW3) 



5<t Sotes tc iKt revest firm 



Boi N«. VI PRIO 



llowiB^aflitr a 



The priority of the followif>^ar1itf apylka<ion<») ti hmby etaimed: 



Funhcf prioriry ctai^^ indiqicd in tht Supplcnwnul ftw (x] 



Country 
(m %-hkK at for %kkK iht 
apptieaion maj ftitdl 



iiem ( I ) 



GB 



Ftlins Date 



17.11.92 



ApplicaiioA No. 



9224057.1 



OfTtcc of filing 
ionlxfofrtiiemalcr 



item (2) 



GB 



08.03.93 



9304677.9 



iieni (3) 



GB 



08.03.93 



9304680.3 



Mark the follo^t chtci-bcxJAe ctni/itdeopy efthe tariitr apptieatiai is»ht issued bt ike Offke *Mi for Oit ptirpottt of On prttttu mtnaauiel 
app(tcaiionatkertcfivintOJpce(a/ttmajbtrtquirtd}: t r- t r 



Th« receivinf OfTict is hereby requested to prepare and transmit to the Iniemational 
Bureau a cenined copy of the earlier appIicatiorKs) identified above as item(s) : 



Box No. Vn EARLIER SEARCH 



fii/ in ***r< a seanh (iiutmetional. iiairttational-iype or other} by the hunationol Seanhini Authoritv has olrtads been nrritiwn or rtauaied md die 
Aujhonty u now requtsied a base tht Mtrtuiianal search, te the eiieni oossibU, <m the results cf that earlier uarck IdemiKsuch uorth or noueu tahetbt 
rejtrenee to tht rtlevaiaepplteaionior the traialaitOTi thereof} or by rtfettiKe»0»seatKkrtqii^ ^ 

Country (or regional OfTice): Date (day/tnonthfyear): 



Nuiober 



Box No, VIII CHECK LIST 



This international application contains 
the following number of sheeu: 



1. request 

2. descripdoa 

3. claints 

4. abstract 
3. drawings 

Total 



5 sheets 

82 jheets 

3 sheets 

1 sheeu 

8 sheets 



99 sheets 



This interrwional appticauon is accompanied by the item(s) marked below: 

2. n **>Py *f fe'>««l « n separate indications concerning 
I I power of attorney ^' i i deposited sucroorganisms 

3. ! I f**"?^ explaining - | — | oucleottde and/or amino acid 
I I lack of signanue '* I I sequence listing (diskette) 

*• □S3iS^.^r«Kf other fxpe«»;: 
as iiem(s): 



Ftgurc No. 



of the drawings (if any) should accompany the abstract when it is published. 



Box No. IX SIGNATURE OF APPLICANT OR AGENT 



Nta 10 each si'tnaturt, indJau tht name of ihe person siinint and the eapociiy i» which the perm ii| v i4 s*ch capacilj is not obvio*tfrom reodittt the tt quest}. 

GILL JENNINGS & EVERY 
Agents for the Applicants 




J. 


Date of actual receipt of the purported 

international application: 


l-of fecej 




2. Drawings: 


3. 


Correcied date of actual receipt due to later but 
timely received papers or drawings completing 
the purported iniemational application: 






1 1 received: 


4. 


Date of timely receipt of the required 
corrections under PCT Article 1 1(2): 






(~] not received: 


5. 


Internationa] Searching Authority fc a / 
specifled by the applicant l^A / 




6. pn Transmittal of search copy delayed 
1 — 1 until search fee is paid 





For International Bureau use only . 



Djie of receipt of the record copy 
by the Intenuiional Bureau: 



Fonn PCT/RO/IOI (last sheet) (January 1993) 



Hee Sities to the request jortn 



Sheet No. 



// in any of Iht Bextt, tht tpaet is insufpeiint to 
furnish all iht information: 

in particular: 

(i) if mart than ihret persons art involvtd as applicants 
and/or invtniors and no 'coniinuation shtti' is 
a^wlablt: 

(ii) if. in Box No. 11 or in any of iht sub-boxts of Box 
No. III. iht indication 'iht Statts indieattd m tht 
Suppltmtnial Box' is chtcktd: 



U 'li) if. in Box So. 11 or in any of tht sub-boxts of Box 
No. Ill, the invtntor or ilit invtnior/apptieani is not 
inventor for tht purposes of all dtsignattd Statts or 
for the purposts of the Unittd Statts of America: 

(iv) if. in addition to tht agentis) indicated in Box No. /V, 
there are further agents: 

(v) if in Box No. V. the name of any Slate (or OAPl) is 
accompanied by the indication paitnt of addition. ' 
'ctrtiftcatt of addition.' or "invtnior's ctrtificatt 
of aadiiion, ' or if. in Box No. V, the name of the 
Unittd Statts of Amtrica is accompanied ty on 
indication 'Continuation' or 'Continuation-in- 
part': 

(vi) if there are more than three earlier applications 
whose priority is ctaimeiL 



Supplemental Box If th^g^pltmintal Box is not used, this shttt nttd not Ij^^luded in tht rtqunt 



>eSSts 



L'st this box in tht following iSsis 



I. 



*. If tht applicant claims, in rtspect of any dtsignattd 
Office, the benefits of provisions of iht national law 
concerning non-prejudicial disclosures or exceptions to 
lack of novtlty: 



Additional Priority Applications 



lA such east, writt "Continuation of Box So. -"lindieait tht number 
of tht Boxj and furnish tht information in the same manner as 
riquirtd according to tht captions of tht Box in which tht space was 
insufficient: 

in such cast, writt 'Continuation of Box No. Ill' and indicatt for tach 
additional ptrson tht samt rvpt of information as rtquirta in Box 
No. Ill; 

in such cast, writt 'Continuation of Box No. 11" or 'Continuation of Box 
No. Ill' or 'Continuation ofBoxts No. II and No. Ill' (as tht cast may 
bt). indicatt the name of tht applicant^) involved and. nr.n to (each) 
such name, tlie Slate or States (and/or. where applicable. European or 
OAPl patent) for the purposts of which tht namtd person is applicant: 

in such east, write 'Continuation ef Box No. War 'Continuation of Box 
No, III' or "Coniinuation ofBoxts No. II and No. Ill' (as tht cast may 
bt), indicatt tht name of tht invtntorisj and, ntxt to (tach) such name, 
tht Siati or Statts (and/or, whtrt applicablt. Europtan or OaPI paitnt) 
for tht purposes of which the named person is inventor: 

in such ease, write 'Continuation of Box No. IV and indicate for each 
further agent the same type of information as required in Box No. IV: 

in such case, write 'Continuation ef Box No. V and the name of each 
Slate involved (or OAPI). and after the name of each such State (or 
OAPI), the number of the parent title or parent application and the date 
of grant of the parent title or filing oflne parent application; 



in such ease, write 'Continuation of Box No. VI' and indicate for each 
additional earlier application the same type of information as required 
in Box No. VI, 

in such case, write 'Statement Concerning Non-Prejudicial Disclosures 
or Exceptions to Lock of Novelty' and fitmish thai statenunt below. 



(4) GB 

(5) GB 

(6) GB 

(7) GB 



28.05.93 
02.07.93 
03.08.93 
15.10.93 



9311047.6 
9313763.6 
9316099.2 
9321344.5 



Form PCT/RO/IOI (supplemenial sheet) (July 1992) 



See Soles tit the r«que.\i Jn^n 



